Gene Tmz1t_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3624 
Symbol 
ID7873129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3981170 
End bp3982885 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content73% 
IMG OID643700564 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002890594 
Protein GI237654280 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCC AGCTCGCCCG CATCGCCCGC TCCCTCTGCC TGGCCTTCGC GCTCGGCGCC 
GGCGGGGTCG CGACGGCGGC CAGCGGCGAG GCCGGCGAGG GCAACCTGCC CGCGCAGGAA
CTCACTCCGC GCACGCTGTA CCACTTCCTG CTCGCCGAGA TCGCCGGCGC GCGCGGCCAG
ATCGGCCTGT CGGCACAGCT CTATCTCGAC CTCGCGCGCA GCACGCGCGA CCCGCGCATC
GCCCGCCGCG CCACCGAGAT CGCGATGTAC TCGCGCAACC TCGTGATGGC GCGCGGCGCC
GCGGAGATCT GGACCGAGGT CGCCCCCGAC TCGGACGAGG CGCGCCGGGT ACTCGCCGGG
CTGAGCGGCG CCGGCCGCGG CGAGGACATC AACCTCGAGG CCATCCAGTT CCAGCTCGCG
CGCGTGCTCG CGCAATCGAA CGGCCGCCTC GCGCAGAACC TGCTGAGCCT CGGCCATACC
CTCGCCCGCG TACCCGACAA ACAGGCGGTG CGCGGCATCG TGATGCGCCT GACCGAACCC
TACGTCGACA TGCCCGAGGC GCACATCGCC CGCGCGCAGG CGGCACAGGC GGTCGAGGAC
AGGATGGGCG CGCTCGCTGC GGTGGATCGC GCCCTCGAGC TGCGCAGCGG CTGGGAACCC
GCGGTGCTGC TGAAGGTGCA GATCCTCCAG CAGGCAGGCG CGCACACGGA GGCCCTGCGC
GTGCTCGAGG CCGAGGCCGC GCGCGCGCCG GCGAGCCGGT CGCTGCGCCT GGCGAAGGCG
CGTGCGCTGG TGAGCGCGCA GCGCTTCGGC GAGGCACGCG CGGCCTTCAA CCAGTTGCTC
GAAGCCTCGC CGCAGGATCC CGAACTGCTC TATGCGGTGG GCCTGCTGTC GATGCAGCTC
GAGGACTTCG CCGCGGCCGA GCTGCACTTC GCACGCGCGC TCGCCGCGGA GCACCCGCAA
CCCGACCTCA TCCGCCTTCA CCTGGGCCAG ATCGCCGCCG ACCGCGGCGA GGGCGAGCGC
GCACGCAAGT GGTTCGGCGA GATCGAGAGC GAGGACCTCC GTCCCGAGGC GGACATCCGC
AGCGCCCTGA GCCTCGCGCA CGAAGGACGC ATCGAGGAGG CGCGCGCCCT GTTACGCAAC
GACGTCGAGG ATCCCGACCT CGCCCGCCGC TACCTGCTCG CCGAGGCGCA GATCCTGCGC
GACGCCGAGC GTCCGACCGA GGCCCTCGCG CTGCTCGATG CCGCCCTGCG CGAGAACCCC
GAAGACACCG GCCTGCTCTA CGAGGCCGCC ATGCTCGCCG AGCGCATCGG TCGCATGGAC
CTGCTCGAAG CGCGCCTGCG CCGCGTGCTC GAGCTGCAGC CCGATCATGC GCATGCGCTC
AACGCGCTCG GTTATTCGCT CGCCGACCGC GGCCTGCGCC TGGACGAAGC CGAGGCGCTG
ATCGCACGCG CGCATGCGCT CATGCCGCAA GACCCCTTCA TTCTCGACAG CCTCGGCTGG
GTGCGCTTCC GCCGCGGCGA TCAGGTCGGC GCGCTCGTCC ACCTGGAGCG CGCCTATGGC
ATGCGCAAGG ACGCCGAGAT CGCCGCCCAC CTCGGCGAGG TGCTATGGAC ACTCGGCCGT
CGGGACGAGG CCCGACGGAT CTTCGCCGAG GCGCTCGCAG CCCACCCCGA CAACCGTCTG
CTGACGGACA CCGGCCGCCG ACTGGGCATC CAGTGA
 
Protein sequence
MKTQLARIAR SLCLAFALGA GGVATAASGE AGEGNLPAQE LTPRTLYHFL LAEIAGARGQ 
IGLSAQLYLD LARSTRDPRI ARRATEIAMY SRNLVMARGA AEIWTEVAPD SDEARRVLAG
LSGAGRGEDI NLEAIQFQLA RVLAQSNGRL AQNLLSLGHT LARVPDKQAV RGIVMRLTEP
YVDMPEAHIA RAQAAQAVED RMGALAAVDR ALELRSGWEP AVLLKVQILQ QAGAHTEALR
VLEAEAARAP ASRSLRLAKA RALVSAQRFG EARAAFNQLL EASPQDPELL YAVGLLSMQL
EDFAAAELHF ARALAAEHPQ PDLIRLHLGQ IAADRGEGER ARKWFGEIES EDLRPEADIR
SALSLAHEGR IEEARALLRN DVEDPDLARR YLLAEAQILR DAERPTEALA LLDAALRENP
EDTGLLYEAA MLAERIGRMD LLEARLRRVL ELQPDHAHAL NALGYSLADR GLRLDEAEAL
IARAHALMPQ DPFILDSLGW VRFRRGDQVG ALVHLERAYG MRKDAEIAAH LGEVLWTLGR
RDEARRIFAE ALAAHPDNRL LTDTGRRLGI Q