Gene TM1040_2418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2418 
Symbol 
ID4076744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2559491 
End bp2560564 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content59% 
IMG OID638007740 
ProductABC transporter related 
Protein accessionYP_614412 
Protein GI99082258 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0403436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.72438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA TTACACTGTC GAAACTGCGG CACAGCTACC TTTCAAATCC CAAATCCGAC 
AGCGACTACG CGCTGAAGGA AATCGATCTT GATTGGCAGG ACGGCGGCGC CTACGCGCTG
CTCGGCCCGT CAGGCTGCGG CAAATCCACC CTGCTCAATA TCATTTCGGG CCTTCTGGTG
CCGTCTGAAG GGCAGATCCT CTTTGACGGT CAGGACGTCA CCAAACTGCC CCCAGATCAG
CGCAACATCG CACAGGTCTT TCAGTTCCCG GTCATCTACG ACACCATGAC CGTCTACGAC
AATCTGGCGT TTCCCCTGCG CAATCGTGGC CGGGACGAGG CCACGGTGGA ACAGCGCGTC
ATGGCGATTG CCGAGATGCT CGAAGTCACC GAGATGCTGA ACCAGAAGGC GGCGGGGCTG
TCTCCTGACA ACAAACAGAA GATCTCCATG GGCCGCGGCC TCGTGCGTGA GGACGTGAAC
GTAGTGATGT TCGACGAACC GCTCACGGTG ATCGACCCGC ATCTGAAGTG GAAACTCCGC
TCCAAGCTCA AGGAGCTGCA TCAGCGGGTC AAAGCGACCA TGATCTACGT CACCCACGAC
CAGACAGAGG CGCTGACTTT TGCGGATCAG GTGGTCGTAA TGCAGCTGGG TGAAGTGGTG
CAGATCGGCA CGCCGGTTGA GCTCTTCGAA CGCCCTGCAC ATACCTTTGT GGGTCACTTC
ATCGGCTCTC CGGGCATGAA CATCATTCCC TGCAGTTATG ACGGCGCGGC GAAGGTCGAA
GGTCACGACA TTGTGCTGGA AGGGCCGGTG CGCGGCACCC CCAATGGTCA GACCGAAATC
GGCATCCGTC CGGAGTTTGT GTCTCTCTCC GACAGCGGCC TGCCCGCGAC TGTGACCAAA
GTCTCGGACG TAGGGCGGCA CACGGTCGTT GAATGCGACT GCCTTGGTCA CAAGGTGAAT
GCCGTGATCG AAGAGGGCGC AGCACCTGAA AAAGGGGCGC AAACCCACCT CGCCTTCCGC
CAAGACCAGA CCCGCCTTTA TGTGGATGGC TGGCTCGCCA CTGATCCGGA GTAA
 
Protein sequence
MAKITLSKLR HSYLSNPKSD SDYALKEIDL DWQDGGAYAL LGPSGCGKST LLNIISGLLV 
PSEGQILFDG QDVTKLPPDQ RNIAQVFQFP VIYDTMTVYD NLAFPLRNRG RDEATVEQRV
MAIAEMLEVT EMLNQKAAGL SPDNKQKISM GRGLVREDVN VVMFDEPLTV IDPHLKWKLR
SKLKELHQRV KATMIYVTHD QTEALTFADQ VVVMQLGEVV QIGTPVELFE RPAHTFVGHF
IGSPGMNIIP CSYDGAAKVE GHDIVLEGPV RGTPNGQTEI GIRPEFVSLS DSGLPATVTK
VSDVGRHTVV ECDCLGHKVN AVIEEGAAPE KGAQTHLAFR QDQTRLYVDG WLATDPE