Gene TM1040_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3601 
Symbol 
ID4075028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp650134 
End bp651030 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content61% 
IMG OID638005120 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_611830 
Protein GI99078572 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACG CAACTGATCC TCTCTATCAA GAAATCCAAG GGCCGACGCC GCGCCAGATG 
CTGAAGACGC GCGCGCTTGG TCACAAGGGG CTAATCTTTG GCCTCTGTGT GGTTGGCCTC
TTGGTCGTGA TCGCCCTCCT CGCCCCGGTT CTGGCGCCAC ATAATCCCTA TGAGCAAAGC
CTGATGAACC GCATGGTACC CCCTGTTTTT CTGGGCGGCA CATGGGAGCA TCCCCTCGGC
ACCGACCATC TCGGGCGGGA TTACCTGTCA CGCCTGATCT ACGGCGCGCG GGTTTCGCTT
CTGATCGGCG CAGTCGCTGC GCTGATTTCT GGCGTGATCG GCACCGCCAT GGGGGTTGCA
GCCGGGTATT TCGGCGGCAA GGTCGACGCG GTTGTGACCT TCCTCATAAA CGTGCGCCTG
GCGATGCCCG TGGTCCTGGT GGCGCTGGCT GTCGTGGCAA TCCTTGGCGG GTCTTTGACG
GTCGTGGTCT GCGTGCTCGG CCTATTGTTG TGGGACCGAT TTGCCGTGGT GATGCGGGCG
TCGACCTTGC AGGTCAGCCG TCGCGACTAC GTGGCCGCCG CTCAGGTGAT CGGAGCCTCG
ACCCCGCGCA TCCTCTTGAC CGAGATCATG CCAAATATCT TCAACAACCT GATCGTGGTG
ATCACGCTGG AGATGGCCCA TGCGATCCTG CTCGAAGCGG CACTGAGCTT CCTTGGTCTA
GGCGTGCAAC CGCCGACCCC TTCGTGGGGT CTGATGGTGA GCGAAGGCAA AAACATGATG
TTGTTTGAGC CTTGGCTGGT TCTCATTCCC GGCGCCGTTT TGTTCCTGCT TGTGCTGGCA
ATCAATCTCA TGGGCGATGG TCTGCGCGAC GTCACCGCCC CCGAAGGACG GAGCTGA
 
Protein sequence
MTDATDPLYQ EIQGPTPRQM LKTRALGHKG LIFGLCVVGL LVVIALLAPV LAPHNPYEQS 
LMNRMVPPVF LGGTWEHPLG TDHLGRDYLS RLIYGARVSL LIGAVAALIS GVIGTAMGVA
AGYFGGKVDA VVTFLINVRL AMPVVLVALA VVAILGGSLT VVVCVLGLLL WDRFAVVMRA
STLQVSRRDY VAAAQVIGAS TPRILLTEIM PNIFNNLIVV ITLEMAHAIL LEAALSFLGL
GVQPPTPSWG LMVSEGKNMM LFEPWLVLIP GAVLFLLVLA INLMGDGLRD VTAPEGRS