Gene TM1040_1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1188 
Symbol 
ID4077797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1277641 
End bp1279176 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content58% 
IMG OID638006494 
ProductABC transporter related 
Protein accessionYP_613183 
Protein GI99081029 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.909394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGCG ACGTGATCCT GCAAATTTCG AACCTCACGA AGTCCTTTGG GCCGGTAAAG 
GCGCTAAAGG GCGTGGATTT CGAACTCCGC CGTGGCGAGA TCCACGCAAT CGCCGGAGAG
AACGGCGCCG GGAAGTCCAC GTTGATGAAC ATCATCGACG GTATTTTGCA GCCCGACAGC
GGTGAAATCC GGCTCGATGG CACCCCTGTC GAAATTCCGT CTCCGGCAGC GGCGCAAAAG
ATGGGGATCG GCTTTGTGCA TCAGGAGATC GCACTCTGTC CGGATGTCTC GGTCGCTGAA
AACATCTTTA TGGCTGCGAC CAATTCAAGC CGGTCTTTTC TGATGGATTA CAAAGGGATC
GAAGCCAAAG CGCGCGAAGT TTTTGCGCAG TTGTCGAGCA TTGATCCCTC TGTGCTCGTC
CGGGACCTTT CGATTTCGAA CCAGCAGCTT GTGGAGATCG CCAAGGCGCT GACGCTGGAC
TGCCGGGTGT TGATCCTGGA TGAGCCAACG GCAGCGCTGA CCGAGGCGGA GGCGCAGGTG
CTGTTCAAGA TCATGCGGCG GCTGGCGGAT CAGGGGATTT CGTTGATTTA TATCTCGCAT
CGTATGGTCG AGATCTTTGA CAATTGCGAC CGGGTCTCGG TGTTTCGCGA TGGCCGCTAT
GTGACGACAC AGGATGTCGC CAAAATCACG CCTGCAGATG TGGTCCGGGC CATGGTGGGG
CGCGAAATTG GCGATCTCTA TCCCGAAAAA CAACGTCCCG AGGCCTGCAG CACGACCGAG
GTCCTCCGGG TTGAAAACCT TTGTGAGGCC GAGCGGTTTC ACGATGTGTC CTTTTCGCTG
CATAAGGGCG AGATCCTGGG CTTTGCCGGG TTGATCGGTG CAGGGCGCAG CGAGATCGCC
AAAGGGGTCT GCGCGCTTGA AGGTCAGGTG ACTGGCAGGC TCTGGCTCAA TGGAGAGCCG
CTGGCGCTGC GCGACTATCA GGACAGCATT GATGCGGGGA TTGTGTATCT CTCCGAGGAT
CGCAAAGGCG ACGGCGTGTT TCTGGATATG TCCATCGCCA GCAATGTTTC GGCGCTGAAG
GTCGAACAGG TGGCCAGCGC CCTCGGGTTG ATCCAGCCTG GCAGGGAAAT AGAGCAGGCG
GACCGACTTG GGCGCAAGCT CAACCTCAAA TGCGGCACGC TGCAGGATCC GGTTTCGTCG
CTTTCGGGCG GCAATCAACA AAAGGTAGCA CTGGCCAAGA TGCTCTCGGT CAACCCGCGA
CTGATCTTTC TGGATGAGCC GACGCGTGGC GTTGACGTTG GCGCAAAGGC CGAAATCTAC
CGCATCCTGC GGGATTTGGC CGAGGAGGGC GCAGGCATTG TGGTTATTTC ATCCGAGCTG
CCAGAACTGA TCGGTCTGTG TGATCGCGTG CTGGTGATCC ACGAAGGGTG TCTGAGCGGT
GAAGTGAGTG GCCCGGACAT GACAGAAGAA AACATCATGC ACCTTGCCTC GGGCACTCAG
CAAGGCACGG GTGCAGCGTC GGTTGCGGCG CAATGA
 
Protein sequence
MTGDVILQIS NLTKSFGPVK ALKGVDFELR RGEIHAIAGE NGAGKSTLMN IIDGILQPDS 
GEIRLDGTPV EIPSPAAAQK MGIGFVHQEI ALCPDVSVAE NIFMAATNSS RSFLMDYKGI
EAKAREVFAQ LSSIDPSVLV RDLSISNQQL VEIAKALTLD CRVLILDEPT AALTEAEAQV
LFKIMRRLAD QGISLIYISH RMVEIFDNCD RVSVFRDGRY VTTQDVAKIT PADVVRAMVG
REIGDLYPEK QRPEACSTTE VLRVENLCEA ERFHDVSFSL HKGEILGFAG LIGAGRSEIA
KGVCALEGQV TGRLWLNGEP LALRDYQDSI DAGIVYLSED RKGDGVFLDM SIASNVSALK
VEQVASALGL IQPGREIEQA DRLGRKLNLK CGTLQDPVSS LSGGNQQKVA LAKMLSVNPR
LIFLDEPTRG VDVGAKAEIY RILRDLAEEG AGIVVISSEL PELIGLCDRV LVIHEGCLSG
EVSGPDMTEE NIMHLASGTQ QGTGAASVAA Q