Gene Tmz1t_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3398 
Symbol 
ID7873889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3716798 
End bp3717802 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content61% 
IMG OID643700337 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_002890369 
Protein GI237654055 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCC GTTCGCTTCT GGTCGGTCTG GTTGCCGTGG GTCTGTCTGC TGCAGCGGTC 
GCCGCAGACC CCATCGTGAT CAAGTTCAGC CACGTGGTGG CGCAGGACAC GCCCAAGGGC
AAGGCTGCCG AAAAATTCAA GGAGCTCGCC GAGAAATACA CCGGCGGCGC GGTCAAGGTC
GAGGTGTACG CGAACAGTAC CCTCTACAAG GACAAGGAGG AGATGGAGGC GCTGCAACTC
GGTGCCGTGC ACCTGCTGGC GCCGTCTCTG GCCAAGTTCG GTCCGCTCGG TGTCAAGGAG
TTCGAGGTCT TCGATCTGCC CTACATCTTC GACGGCTACG AGGCGCTGAA CAAGGTCACC
CAAGGTGCGG TCGGCCAGCA GCTGCTCGCC AAGCTCGAGC CCAAGGGCAT CAAGGGCCTA
GCCTTCTGGG ACAACGGTTT CAAGTCGTTC TCGGCCAATA GCCCGATCAG GAAGCCGGAA
GACCTCAAGG GCAAGAAGAT GCGCATCCAG TCGTCCAAGG TGCTGGAAGA GCAGATGCGC
GAGATCAAGT CGCTGCCGCA GGTGATGGCC TTCTCCGAGG TCTACCAAGC GCTGCAGACC
GGCGTCGTCG ATGGGACCGA GAACCCGCAC TCCAACCTCT ACACCCAGAA GATGCACGAG
GTGCAGAAGC ACATGACCCT GACCGACCAT GGCTACCTGG GCTATGCGGT CATCACCAAC
AAGAAGTTCT GGGACGGCCT GCCGGCCGAG GTGCGCACGC AGCTCGACAA GGCGATGAAG
GAATCGACCG TCTACGCCAA CCAGATCGCC AAGGAAGAGA ACGACAAGTC GCTCGCGGCG
GTGCGTGCCT CCGGCAAGAC CGAGGTCTAT GCGCCGACCG CCGAAGAGAA AGCCGCGTTC
AAGAAGGCGC TCGTCCCGGT GCACAAGAAG ATGGAGTCGC GCATCGGCGC AGAGCTGATC
CAGTCGATCT ACAAGGAAAC CGGGTTCGAT CCGGCCAAGC TCTGA
 
Protein sequence
MKIRSLLVGL VAVGLSAAAV AADPIVIKFS HVVAQDTPKG KAAEKFKELA EKYTGGAVKV 
EVYANSTLYK DKEEMEALQL GAVHLLAPSL AKFGPLGVKE FEVFDLPYIF DGYEALNKVT
QGAVGQQLLA KLEPKGIKGL AFWDNGFKSF SANSPIRKPE DLKGKKMRIQ SSKVLEEQMR
EIKSLPQVMA FSEVYQALQT GVVDGTENPH SNLYTQKMHE VQKHMTLTDH GYLGYAVITN
KKFWDGLPAE VRTQLDKAMK ESTVYANQIA KEENDKSLAA VRASGKTEVY APTAEEKAAF
KKALVPVHKK MESRIGAELI QSIYKETGFD PAKL