Gene TM1040_0338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0338 
Symbol 
ID4076039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp342242 
End bp343732 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content60% 
IMG OID638005633 
ProductNCS1 nucleoside transporter 
Protein accessionYP_612333 
Protein GI99080179 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA TAACAAGCGG TTATGCCGCC TCAGAGGGTG CCGCGCACGC GCAGCTGGAC 
CGTGCGCGCC TGGACCCCGA ACTCTATAAC GAGGATCAAC TGCCCACCAC CGCCGCCGAG
CGCACATGGA ATTGGCTGTC GATTTCGGCG CTTTGGGTCG GGATGGTGGT CTGTATCCCA
ACGTATCTTC TGGCGTCCTA TCTGATTGGC GCGGGAATGA GCTGGGATCA GGCGGTTCTG
ACCATTCTGG CGGCCAATGC CATCGTGCTC ATCCCGATGG TGCTGGTGGG CCATGCCGGC
ACCAAATACG GCATCCCGTT TCCGGTGCTG TTGCGCGCGT CCTTTGGTCC CGTGGGGGCC
AAGATCCCGG CTGTTGCCCG TGGGATCGTG GCCTGCGGCT GGTTTGGCAT CCAGACATGG
GTTGGCGGTT CTGCGATCTT TGTGATCGTC AACAAGCTCA CCGGAGGCGC GCTCGCGGCG
GAGGCGCTGC CGCTTCTGGG GATCAGCCTT GGCGAATTTG TCTGCTTCTT GGCGTTCTGG
GGCCTGCATC TCTACTTCAT CAAGAACGGC ACCGAGTCGA TCCGCTGGCT TGAGACCTAT
GCCGCGCCGT TCCTGCTGGC GATGGGGCTT GCGCTCCTGG CATGGGCCTA TTCTGCGGCG
GGCGGATTTG GCGAGATGCT CTCCACCCCC AGCGCCTTTG ATGTGGGCCA GCCCAAAGAG
GGGCAGTTCT GGGCGGTGTT CTGGCCAAGC CTGACGGGTA TGATTGGCTA TTGGGCGACG
CTTGCGCTCA ATATTCCGGA TTTCACCCGT CACGCCCGCA GCCAGAAAGA TCAGTTGGTG
GGGCAGATGG TGGGTCTGCC GATCCCGATG GCGTTGTTTG CCTTTATCGC CTCTGCTGTG
ACCTCGGCCA CGGTTGTGAT CTTTGGTGAG GCGATCTGGG ATCCGGTGCA GCTCTCTGAA
CGCATGGGCG GCTCTGCGGT GATCATCGCG CTTTTTGCGC TGATCGTGGC GACGCTGACG
ACAAACCTTG CGGCCAATGT GGTGGCCCCG GCGCATGGGT TTGCCAATCT TGCGCCAAGC
AAGATCAACC TTAAGCGTGG CGGCTATATC ACGGCAGCCA TCGGGATCGC GATGTTCCCG
TGGATTCTGG TGAACCACAT CGTTGGGTGG CTGATTGCCT ATTCCGCGCT TTTGGGGCCG
ATCGCGGGCG TGATGCTGGC GGATTATTAC CTCTTGCGCA AAACCCGGCT CGAGGTTGCG
GATCTCTTTA AATCAAACGG CATCTATGCG GGGCACAATG GCACCAACTG GGCCGGTGTC
CTGGCGCTGG TCATCGGCAT TCTGCCAAAT CTGCCGGGCT TTCTGGCCGG GGTGGGGCTT
ACGGATGGGA CCTCGCCGTT CTTTGCGATG ATCTATACCT ACGCCTGGTT TGTGGGGCTC
TTTGTTGCGG GCGCGGCCTA TTTGCTGCTC TCCAAAATGG TTAACAAATA A
 
Protein sequence
MTDITSGYAA SEGAAHAQLD RARLDPELYN EDQLPTTAAE RTWNWLSISA LWVGMVVCIP 
TYLLASYLIG AGMSWDQAVL TILAANAIVL IPMVLVGHAG TKYGIPFPVL LRASFGPVGA
KIPAVARGIV ACGWFGIQTW VGGSAIFVIV NKLTGGALAA EALPLLGISL GEFVCFLAFW
GLHLYFIKNG TESIRWLETY AAPFLLAMGL ALLAWAYSAA GGFGEMLSTP SAFDVGQPKE
GQFWAVFWPS LTGMIGYWAT LALNIPDFTR HARSQKDQLV GQMVGLPIPM ALFAFIASAV
TSATVVIFGE AIWDPVQLSE RMGGSAVIIA LFALIVATLT TNLAANVVAP AHGFANLAPS
KINLKRGGYI TAAIGIAMFP WILVNHIVGW LIAYSALLGP IAGVMLADYY LLRKTRLEVA
DLFKSNGIYA GHNGTNWAGV LALVIGILPN LPGFLAGVGL TDGTSPFFAM IYTYAWFVGL
FVAGAAYLLL SKMVNK