Gene TM1040_2416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2416 
Symbol 
ID4076742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2556474 
End bp2558207 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content60% 
IMG OID638007738 
Productextracellular solute-binding protein 
Protein accessionYP_614410 
Protein GI99082256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000012875 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAGT ATCTTTTGAC CGCAGTTGCA GCCGGGGCTG TAATTGCGGC AACGGGTTCG 
TATGCCGATG AGGCAGCGGC CCAGAAATGG ATCGACGAGG AGTTTCAGCC GTCGGTTCTG
AGCAAAGCAG AACAACTTGC CGAGATGCAG TGGTTCATCA ACGCCGCCGA GCCTTACAAG
GGCATGGAGA TCAACGTACT GTCCGAGGGC ATCCCCACGC ACAGCTACGA ATCCGAGGTG
CTGACCAAGG CGTTTGAGGA AATCACCGGC ATCAAGGTGA ACCACCAGAT CCTGGGCGAA
GGCGAGGTCG TTCAGGCCGT GCAGACCCAG ATGCAGACCA AGCGGAACCT CTATGACGCA
TACGTCAACG ACTCCGACCT GATCGGCACG CACTCGCGCC TGCAGCTCGC TTACAACCTG
AGCGACATGA TGGAAGGCGA CTTCAAGGAT GTGACCAACC CCGGTCTCGA CCTTGACGAT
TTCATGGGCA CCCAGTTCAC CACTGGCCCC GATGGCGACC TCTACCAGCT GCCCGACCAG
CAGTTTGCGA ACCTCTACTG GTTCCGCAAA GATTGGTTCG ACCGCGAAGA TCTGAAGGCC
GCCTTCAAAG AGAAATACGG CTACGAGCTG GGTGTTCCGG TCAACTGGTC CGCCTATGAA
GACATTGCCG AGTTCTTCTC TGAAGATGTG AAAGAAATCG ACGGCACCAC CATCTACGGC
CACATGGATT ACGGCAAACG CGCGCCTGAC CTCGGCTGGC GGATGACCGA TGCGTGGCTC
TCCATGGCCG GTGCGGGCTC CAAGGGTGAG CCGAACGGTG TTCCGATCGA CGAATGGGGC
ATCCGTATGG AAGAAGGCAC CTGTAACCCG GTGGGCGCAA GCGTCACCCG CGGCGGTGCT
GCAAACGGTC CGGCAGCAGT CTATGCGATC CGCAAGTGGG ACGAATGGCT GCGCAAATAC
GCACCTCCCG GTGCCGCGTC TTATGACTTC TACCAGTCTC TGCCCGCACT CGCTCAAGGC
AACGTCGCGC AGCAGATCTT CTGGTACACC GCCTTTACCG CAGACATGGT GAAGCCGAAG
TCCGAAGGCA ACAACACCGT CGACGACAGC GGCACCCCGC TGTGGCGCAT GGCACCGAGC
CCGCATGGCC CCTACTGGGA AGAAGGCCAG AAGGTTGGCT ATCAGGACGT GGGCTCCTGG
ACCTTCCTCA ACTCCACCCC GCTGGACCGC GCACAAGCCG CATGGCTCTA TGCTCAGTTC
GTCGTCTCCA AGACCGTCGA CGTGAAGAAG TCCCACGTGG GTCTGACCTT CATTCGCGAC
AGCTCCGTCA ACCACGAGAG CTTCACCGAG CGTGCGCCCA AACTGGGTGG TCTGGTGGAA
TTCTACCGTT CGCCCGACCG GACTGCATGG TCCCCGACCG GCATCAACGT GCCTGACTAT
CCCAAGCTGG CGCAGATCTG GTGGCAGCAG ATTGGTGACG TGAACTCCGG TGCCTTCACC
CCGCAAGAAG CGATGGATCG TCTGGCGCAG GAAATGGACA TCACCATGGG TCGTATGCAG
CGTGCAGACG AGCAGGCGAA TGTCTATGGC GGCTGCGGCC CGCGTCTGAA CGAAGAAAAA
GACGCGGAGT GGTGGTACGC CAATGGCGGC GCCAAGCCGA AGCTGGAGAA CGAAAAGCCG
CAAGGCCAGA CCGTCAACTA TGACGAGCTG GTGGCGCGCT GGGCCGCGAA CTGA
 
Protein sequence
MRKYLLTAVA AGAVIAATGS YADEAAAQKW IDEEFQPSVL SKAEQLAEMQ WFINAAEPYK 
GMEINVLSEG IPTHSYESEV LTKAFEEITG IKVNHQILGE GEVVQAVQTQ MQTKRNLYDA
YVNDSDLIGT HSRLQLAYNL SDMMEGDFKD VTNPGLDLDD FMGTQFTTGP DGDLYQLPDQ
QFANLYWFRK DWFDREDLKA AFKEKYGYEL GVPVNWSAYE DIAEFFSEDV KEIDGTTIYG
HMDYGKRAPD LGWRMTDAWL SMAGAGSKGE PNGVPIDEWG IRMEEGTCNP VGASVTRGGA
ANGPAAVYAI RKWDEWLRKY APPGAASYDF YQSLPALAQG NVAQQIFWYT AFTADMVKPK
SEGNNTVDDS GTPLWRMAPS PHGPYWEEGQ KVGYQDVGSW TFLNSTPLDR AQAAWLYAQF
VVSKTVDVKK SHVGLTFIRD SSVNHESFTE RAPKLGGLVE FYRSPDRTAW SPTGINVPDY
PKLAQIWWQQ IGDVNSGAFT PQEAMDRLAQ EMDITMGRMQ RADEQANVYG GCGPRLNEEK
DAEWWYANGG AKPKLENEKP QGQTVNYDEL VARWAAN