Gene TM1040_2417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2417 
Symbol 
ID4076743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2558405 
End bp2559481 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID638007739 
ProductABC transporter related 
Protein accessionYP_614411 
Protein GI99082257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0208356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.753894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTCG AACTCAGATC CGTGACCAAA CGCGTGGGCG GTGATCTCCA TATCAAGGAG 
ACCTCCCTGA CGCTCGAGCC CGGTCACTTC AACGTCCTTC TGGGCGCCAC CGGGTCGGGA
AAGACCTCTC TTATCAAGAT GATGGCCGGG CTTGACCCGA TTGCCTCCGG CTCTGTCCTC
ATGGACGGGC AGGACGTGAC CCGCCTCAAC ACACAAAAGC GCAACATCAG CCTTGTGCAT
CAGTTTTTCA TCAACTACCC GCACATGACG GTCTACGACA ATATCGCCTC GCCACTCAAA
GTTGCGGGCA TGGCAAAGTC GGAACTTGAT GACCGCGTGC AGGAAGCGGC GAAAATTCTG
CAGCTCACCC CAATGTTGCA TCGCCGCCCG CACGAGCTCT CTGGCGGTCA GCAGCAGCGG
ACCGCGCTGG CGCGTGCGAT TGCAAAGGAA AGCCGCGCTG TCTTCCTCGA CGAGCCGCTG
GCGAACCTCG ACTATAAGCT GCGCGAGGAA TTGCGCGATC AGCTGCCGGA GCTCTTTGCC
GGTCGTGGCG CGGTTGTGGT CTATGCCACC TCTGAGCCCG AAGAGGCGCT CCTTCTTGGC
GGCAAGACAG CACTCATGCG CGATGGCCGC GTGACCCAAT TCGGCCCCAC CGCAGAGATC
TATCGCAATC CTGAAAACGT CGAAGCCGCA CGCGTGTTCT CCGACCCGCC GATCAACACG
GCGACAATCA CCAAACAAGG CTTTGAGGCG CGTTTGGGGC CGGATGTGCG CTGGACCCTG
GATGGCGCGG CTGCCAGCCT GAAGGACGGC ACCTACACCA TCGCAATCCG CCCGCATCAT
GTCACCCCGG TGGCATCCTC GGCAGGACTG GTAAGACTCA ACGGTCGCGT GCAGGTGACA
GAGCTATCCG GTTCCGAAAG CTCGGCGCAT TTCGATCTTG CGGCCTCCGG GCAGGAAACC
TCCTGGGTGT CCCTGAGCCA CGGCGTCCAC CCCTACGAGG TTGGCGAATT GCATGATTTC
TATATGGACC CGCGGGCGGC ATATGTCTTT GCCCCTGACG GCTCCCGCGT GGCGTGA
 
Protein sequence
MALELRSVTK RVGGDLHIKE TSLTLEPGHF NVLLGATGSG KTSLIKMMAG LDPIASGSVL 
MDGQDVTRLN TQKRNISLVH QFFINYPHMT VYDNIASPLK VAGMAKSELD DRVQEAAKIL
QLTPMLHRRP HELSGGQQQR TALARAIAKE SRAVFLDEPL ANLDYKLREE LRDQLPELFA
GRGAVVVYAT SEPEEALLLG GKTALMRDGR VTQFGPTAEI YRNPENVEAA RVFSDPPINT
ATITKQGFEA RLGPDVRWTL DGAAASLKDG TYTIAIRPHH VTPVASSAGL VRLNGRVQVT
ELSGSESSAH FDLAASGQET SWVSLSHGVH PYEVGELHDF YMDPRAAYVF APDGSRVA