Gene Tmz1t_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2356 
Symbol 
ID7094278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011667 
Strand
Start bp20799 
End bp21878 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content72% 
IMG OID643701044 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002364185 
Protein GI217980135 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value0.337618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.0354802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC GCCAAGCCCC AGAAATCTTC CAGATCCCGC TCGACGACCT CGACCCGGCG 
GGCCTGCCGC CGCGCGGCAC GCCCGAGTTC GAGCAGGCGG TGATCGGGCG CTACGCGCTC
GACTACGCGG CGCGGGGCTG GCAGGCCGTG GTGGCGGTCG ATGAGGGTTT CGTGCGCGTG
GTCGCGGTCC CCGAGCGCGG GGTCGAGCCG AAGGCCTACG TGCTGGGGCT GCTGCAAAAC
GGCTTCCTGG AGGATGCGCT GCCGGTGCTC GAGGCGCTCG ACGGCATGCT GGACGACGCC
GAGATCGCCT ACAGCCACGG GCTGTGCCTG TCCGAACTGA GGCGACCGGC AGAGGCGGTC
GCCCCGCTGC AGCGGGCGGT CGAACTCGAC CCCACGCACG CGAACGCGTT CATCGCGCTC
GGGGTAGCGT TCGCGCGCAC CGGGCGCGCC GACGAGGCCG CCGACGCGCT GCGCGACGCG
GTCAAGCTCG AGCCGGAGAA CGCCTTCGCC AAGCGCAACC TGGCGGCGGT GCTGATGCGT
TCCGGCCGGA CGGCCGAGGC GCTGCCGTTC TTCCGCCAGG CGGCGAGCCT CGCGCCGGCG
GATCCGGGGG CGCAGCTCGG GCTCGCGCAG TGCCTGGAAG AGCTCGGGCC CTCGCACGTG
AAGGAGGCGG CCGAGCAGTA CAAGGCGGTG GTCAAGCGTT TTCCCGAGCA CCAGGCGGGC
GAGATGGCCG AAGAGGCGCT CACGCGCATC GGGCAGGACG AGCTGCGCGC GGCGGTCGAC
GGCGGGCTGC GCATGGACGC GGTGATGTAC ATGCAGGCGG CGCTGGACCG CTTCGCCAAG
CTCGACCAGG CGAAGGTCGG GCAGATCGTG ATGGAGATCG CGCTGCTCGG CCGCAACGGG
CTCGAGATCA ACAAGCCGGC CGTGCGCTAC ACGCTCGAGA ACCTCGAGGG CGAGTTCTCC
GGGCTGGCCC TGCTGGCGTA CATGCACGTG GGGTTCCGGA TGTTCGACGC CAAGGGCGAC
GCCGGAACCG GGCTCGATCG CGAATACGAG GCGGCGGTGA AGATGCGCCG CGAGCGCTGA
 
Protein sequence
MTTRQAPEIF QIPLDDLDPA GLPPRGTPEF EQAVIGRYAL DYAARGWQAV VAVDEGFVRV 
VAVPERGVEP KAYVLGLLQN GFLEDALPVL EALDGMLDDA EIAYSHGLCL SELRRPAEAV
APLQRAVELD PTHANAFIAL GVAFARTGRA DEAADALRDA VKLEPENAFA KRNLAAVLMR
SGRTAEALPF FRQAASLAPA DPGAQLGLAQ CLEELGPSHV KEAAEQYKAV VKRFPEHQAG
EMAEEALTRI GQDELRAAVD GGLRMDAVMY MQAALDRFAK LDQAKVGQIV MEIALLGRNG
LEINKPAVRY TLENLEGEFS GLALLAYMHV GFRMFDAKGD AGTGLDREYE AAVKMRRER