Gene TM1040_3483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3483 
Symbol 
ID4075123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp514420 
End bp515673 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content60% 
IMG OID638004998 
Productextracellular solute-binding protein 
Protein accessionYP_611717 
Protein GI99078459 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTGC TGAAGGGAAC GGCCGCAGGA CTTGCGATGG CCGTCGGGCT GGCCGCATCT 
GCTCAGGCAT CGGAGCTGAC CGGAACGCTC AAGATTTTCT CGGATATGTC GAACCCGGCA
CCGCGCGCGG TGATGGAGAA GATGGCGTCC GATTTTGATG CACTGCATCC CAATCTGAAA
GTCGAACTGA CCGTCATCGA CCGCGAAGCC TACAAGACCC AGATTCGCAA CTTCCTGACC
GCCAATGCGC CGGATGTTGC CAACTGGTAC GCTGCCAACC GCATGCGCCC CTATGTGTCG
GCCGGTCTCT TTGAGGATGT CTCTGACCTC TGGGCAGAGC CTGCGATTGC GGAAAACCTT
GCGTCCACCA AGGGCGCGAT GACGCTTGAT GGCAAGCAGT GGGGCGTGCC CTATACCTAC
TATCAGTGGG GCGTCTACTA CCGCGAGGAC ATCTACAACG AACTGGGTCT CGAAGAGCCA
AGCGACTGGG CAACCTTCAA GTCCAACTGC CAGAAGATTC TCGACTCGGG CCGCAAGTGC
TTCACCATTG GTTCCAAGTT CCTCTGGACC GCCGGCGGCT GGTTTGACTA TCTGAACATG
CGTACCAACG GCTACGACTT CCACATGGCG CTGACCAATG GGGACGTGGA ATGGACCGAT
GACCGAGTGA AGCAAACCTT TGCCAATTGG CGCGAGCTGA TCGACATGGG CGCCTTTATC
GACAACCACC AGTCCTACAG CTGGCAGGAG GCGCTGCCCT TCATGGTGAA TGGTGAAGCG
GCGGCCTACC TCATGGGGAA CTTTTCCGTG GCCCCGCTGC GCGAAGCGGG TCTGAGCGAC
GAGCAACTTG ATTTCTACCA GTTCCCGGCG ATCAACCCGG ATGTCGAGCT GGCCGAAGAT
GCGCCGACCG ATACGTTCCA CATCCCGTCC GGGGCCCAGA ACAAGGAAGC GGCGCGTGAG
TTCCTGCGCT ATGTGGTCTC TGCGGACGTG CAGACCGCGA TCAATGCGGG CGACGCACTT
GGGCAGCTGC CGGTCAATGC CTCTTCCTCG GTGGATGATG ACGAGATGCT GAACCAGGGC
TTCGAGATGC TCTCCTCCAA CAGCCCCGGC GGTATCGCGC AGTTCTTTGA TCGCGACGCC
CCGGCCGAGA TGGCCTCGGT GGCGATGGAA GGCTTCCAGG AGTTCATGGT GTTCCCCGAC
AATCTCGACG ACATCCTGAA CCGTCTCGAG AAGGCCCGTC AGCGGATCTA CTAA
 
Protein sequence
MNLLKGTAAG LAMAVGLAAS AQASELTGTL KIFSDMSNPA PRAVMEKMAS DFDALHPNLK 
VELTVIDREA YKTQIRNFLT ANAPDVANWY AANRMRPYVS AGLFEDVSDL WAEPAIAENL
ASTKGAMTLD GKQWGVPYTY YQWGVYYRED IYNELGLEEP SDWATFKSNC QKILDSGRKC
FTIGSKFLWT AGGWFDYLNM RTNGYDFHMA LTNGDVEWTD DRVKQTFANW RELIDMGAFI
DNHQSYSWQE ALPFMVNGEA AAYLMGNFSV APLREAGLSD EQLDFYQFPA INPDVELAED
APTDTFHIPS GAQNKEAARE FLRYVVSADV QTAINAGDAL GQLPVNASSS VDDDEMLNQG
FEMLSSNSPG GIAQFFDRDA PAEMASVAME GFQEFMVFPD NLDDILNRLE KARQRIY