Gene Tmz1t_0498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0498 
Symbol 
ID7085009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp559939 
End bp560970 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content68% 
IMG OID643697527 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_002354169 
Protein GI217968935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.962606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAAAC GCCGCTTCAC CGCCCTCATC GCCGGCCTGT TCGCCTCGAC CGCGCTCGGT 
TTCTCGATGC CGGCCACGGC CCAGCAGTAC AAGGACGAGT ACAAGCTCTC CACGGTGCTC
GGCGAGGCCT TCCCGTGGGG CTGGGGCGCC AAGCGCTGGG CCGACCTGGT CGCAGAGAAG
ACCGAAGGTC GCATCAAGAT CAAGGTGTAT CCGGGCACTT CGCTGGTGTC GGGCGACCAG
ACCAAGGAAT TCACCGCGCT GCGCCAGGGC ATCATCGACA TGGCGGTCGG TTCCACGATC
AACTGGTCGC CGCAGGTCAA GGAGCTCAAC CTGTTCGCGC TGCCCTTCCT GATGCCCGAC
CACAAGGCGA TCGACGCGCT CACCCAGGGC CGCGTCGGCA AGAAGATGTT CGACATCCTC
GCCGAGCGCG ACGTGGTGCC GCTGGCCTGG GGCGAGAACG GTTTCCGCGA GGTCTCCAAC
TCGAAGAAGC CGATCCGCAC GCCCGAGGAC GTCAAGGGCA TGAAGATGCG CGTGGTCGGT
TCCCCGCTCT TCCTCGCCAC CTTCAACGCG CTCGGCGCCA ACCCGACGCA GATGAGCTGG
GCCGACGCCC AGCCGGCGAT GGCGACCGGC GCGGTCGACG GCCAGGAGAA CCCGCTCGCG
GTGTTCAACG CCGCCAAGCT GCACACCGTG GGGCAGAAGA ACCTGACCCT GTGGGGCTAC
GTCGCCGACC CGCTGATCTT CGTGGTGAAC AAGTCCGTGT GGAACTCGTG GTCCGAGGCC
GACCGCAAGG CCGTGTCCGA GGCCGCGCAG CAGGCTGCGA AGGAAGAGAT CGCGCGCGCG
CGCGCCGGCA TCTCGGCGGC CGACGACGCG CTGCTGAAGG AGATCGAGGC CAACGGCGTG
GCCGTGGTGC GCCTGACCGA TGCCGAGCGC GACGCCTTCC GCCAGGCCAC CGCGGGCGTG
TACAAGGAGT GGGCCGAGAA GATCGGCGCC GACCTCGTCA AGCAGGCCGA GGAAGACATC
GCCAAGCGCT GA
 
Protein sequence
MQKRRFTALI AGLFASTALG FSMPATAQQY KDEYKLSTVL GEAFPWGWGA KRWADLVAEK 
TEGRIKIKVY PGTSLVSGDQ TKEFTALRQG IIDMAVGSTI NWSPQVKELN LFALPFLMPD
HKAIDALTQG RVGKKMFDIL AERDVVPLAW GENGFREVSN SKKPIRTPED VKGMKMRVVG
SPLFLATFNA LGANPTQMSW ADAQPAMATG AVDGQENPLA VFNAAKLHTV GQKNLTLWGY
VADPLIFVVN KSVWNSWSEA DRKAVSEAAQ QAAKEEIARA RAGISAADDA LLKEIEANGV
AVVRLTDAER DAFRQATAGV YKEWAEKIGA DLVKQAEEDI AKR