Gene TM1040_3217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3217 
Symbol 
ID4075321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp212764 
End bp213834 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content62% 
IMG OID638004726 
ProductABC transporter related 
Protein accessionYP_611453 
Protein GI99078195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.257824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGA CTGCACAGGG GCAGGCCATC GCACTCGAGC AGGTGCGCAA GGTTTGGGGC 
GAGACTGTCT TGCTCGAAGG TCTCAGCCTT GACGTTCCTG CGGGATCATT CACTGCGGTG
CTGGGTCCGT CGGGATGTGG CAAATCCACG ACGCTTCGCA TCATAGCAGG CCTTGAGGCG
GTGACGACGG GGCAGGTTCG GATCGGGAAC CGCGATGTGA CCCAGATGAG CCCTGCGCAG
CGTGGCATTT CCATGGTGTT TCAGTCCTAT GCCTTGTTTC CCCACCTTAC CGTTGCGGAA
AACATTGTTT TTGGTCTGAA AGTTGCAGGG CTGCCGCGAC GCGAGCGTCA GCGCCGTCTG
AAGGATGTGG CCGACCTTCT CGAACTTGGC CCCTATCTCG GACGTAAACC TGCGGCGCTT
TCAGGAGGCC AGCAGCAACG GGTGGCCCTG GGGCGGGCTG TGATCTCGCA GCGCCCGGTT
TGCCTGATGG ATGAGCCTCT CTCCAATCTG GACGCCCGAT TGCGCGACGA GATGCGGCGT
GAAATCCGCC GTCTGCAGCT CGCGCTCGGT TTCACGATGG TCTATGTGAC CCACGATCAG
ACCGAGGCGA TCACCATGGC TGATCGCGTG GTCTTGATGA ACAAAGGCCA GATTGAGCAA
GTCGCCGCCC CGGAAGAGAT CTACAACCGT CCCGCGACGC CCTTTGCGGC GCGGTTTATC
GGTAACCCGC CGATGAATCT TCTGTCTCCT GCAGCCTTTG GGGCGGCGCT TGAGGCCGCG
CCCGAGAGTC TTCGGATCGG GGTGCGACCC GAAGCACTCA CGGAATCCGC CCAAGGCCCG
ATCCTCGCGG AGGTCAAAGG TGCGGAATTC CTCGGCGCGG ACACTCTGGT AGAGCTGACC
TGCGCAGGCG GCGACATTTT GGCCAAGCTG CCCGGCACAC GCGGTCTTAC CCCCGGCGAG
ACGCTGAGAT TGGCGGTTCC GCCCGAGCAG ATCCATGCCT TTGACCTGAC ACGGAACGCG
CGTCTGGAAG ATCCAGGCCT GATCGCGCGT CTGCAGGAGA TTCACTCATG A
 
Protein sequence
MTATAQGQAI ALEQVRKVWG ETVLLEGLSL DVPAGSFTAV LGPSGCGKST TLRIIAGLEA 
VTTGQVRIGN RDVTQMSPAQ RGISMVFQSY ALFPHLTVAE NIVFGLKVAG LPRRERQRRL
KDVADLLELG PYLGRKPAAL SGGQQQRVAL GRAVISQRPV CLMDEPLSNL DARLRDEMRR
EIRRLQLALG FTMVYVTHDQ TEAITMADRV VLMNKGQIEQ VAAPEEIYNR PATPFAARFI
GNPPMNLLSP AAFGAALEAA PESLRIGVRP EALTESAQGP ILAEVKGAEF LGADTLVELT
CAGGDILAKL PGTRGLTPGE TLRLAVPPEQ IHAFDLTRNA RLEDPGLIAR LQEIHS