Gene TM1040_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0849 
Symbol 
ID4076024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp901433 
End bp902737 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content61% 
IMG OID638006147 
Productextracellular solute-binding protein 
Protein accessionYP_612844 
Protein GI99080690 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0177782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTT CAAAAATTGC ACTGAGCTGC GCACTGGCGA CCGCTCTGAC CGCCGGGGCC 
GCCTGGGCCG AGACCGAAAT CCAGTGGTGG CACGCCATGG GCGGCGCCAA TGGCGAGCGC
ATCGACAAGA TGGCGGCAGA CTTCAACGCC AGCCAGTCCG AGTATAAAAT CGTGCCCACC
TACAAGGGCA ACTACACTGA AACCATGACC GCCGCCGTGG CCGCGTTCCG CGCGGGTGAG
CAGCCGCACC TTGTACAGGT GTTCGAAGTG GGCACCGCCA CCATGATGGC TGCCAAAGGG
GCGATCTACC CCATCGAGCA GATGATGTCC GATGCGGGCG AAGCCTTTGA CAAATCCGAC
TATCTGCCCG CGGTGATTTC TTATTACCAG ACCCCCGAGG GGGAACTGCT GTCGATGCCG
TTCAACAGCT CGACACCGGT TCTGTGGTAC AATGCCGATG CCTTCAAATC CGCAGGCGTC
GATGTCCCGG AAACCTGGGA TGACGTGAAA TCCGCTGCTC AGGCGCTGGT CGACAACGGC
ATGGAGTGCG GCCTGTCCTT CGGTTGGCAG TCCTGGGTGA TGGTTGAGAA CTTCTCGGCT
TGGCACAACA TCGAGATGGG CACCAAGGAA AACGGCTTTG CCGGGTTCGA CACCGAGTTC
ACCTTCAACA ACGAGCAGGT TGCGGCCCGC CTCGAGGACA TCGCCTCCAT GAGCGAGGGC
AACCTCTTCA AATATGGCGG TCGTCGCGGC GACAGCCTGC CGCTGTTCAC CAACGGTGAA
TGCGGGATGT GGATGAATTC CTCGGCCTAT TACGGCTCCA TGGTCGAGCA GGCAGAGTTC
GAATTCGGCC AGACCATGCT GCCGCTCGAC ACCTCGGTTG CGGACGCGCC TCAGAACTCC
ATCATCGGCG GTGCGACCCT CTGGGCGCTG GCCGGTCACG AGGCCGAGGA ATACAAGGGT
CTGGCGCAGT TCATGACCTA TCTTTCCTCG CCCGAAGTTC AGGCATGGTG GCACCAGGAA
ACCGGCTATG TGCCGATCAC CACTGCCGCG TATGAGCTGA GCAAGGAGCA GGGTTTCTAT
GACGAAAACC CCGGCACCGA CACCGCGATC AAGCAGCTGA GCCTGAACGC GCCGACGCCG
AACTCCCGCG GGATCCGCTT TGGCAACTTC GTGCAGGTGC GTGACGTGAT CAACGAAGAG
CTCGAAGCGC TCTGGGCTGG TGACAAGACC GCCTCCGAAG CCCTCGATGC CGCCGTTGAG
CGTGGTAACG CGCTGCTGCG CAAATTCGAG CGCTCCGCGA AGTAA
 
Protein sequence
MGVSKIALSC ALATALTAGA AWAETEIQWW HAMGGANGER IDKMAADFNA SQSEYKIVPT 
YKGNYTETMT AAVAAFRAGE QPHLVQVFEV GTATMMAAKG AIYPIEQMMS DAGEAFDKSD
YLPAVISYYQ TPEGELLSMP FNSSTPVLWY NADAFKSAGV DVPETWDDVK SAAQALVDNG
MECGLSFGWQ SWVMVENFSA WHNIEMGTKE NGFAGFDTEF TFNNEQVAAR LEDIASMSEG
NLFKYGGRRG DSLPLFTNGE CGMWMNSSAY YGSMVEQAEF EFGQTMLPLD TSVADAPQNS
IIGGATLWAL AGHEAEEYKG LAQFMTYLSS PEVQAWWHQE TGYVPITTAA YELSKEQGFY
DENPGTDTAI KQLSLNAPTP NSRGIRFGNF VQVRDVINEE LEALWAGDKT ASEALDAAVE
RGNALLRKFE RSAK