Gene Tmz1t_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3872 
Symbol 
ID7873523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4268323 
End bp4270551 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content69% 
IMG OID643700814 
Productcytochrome C family protein 
Protein accessionYP_002890837 
Protein GI237654523 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.817906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCAAC TCGCCGTCTT CATTCTTGCC CTGCTGTTGG CCCCCTGGTC GGTGGCTGGC 
TCCGGGTATG TCGGCAGTAC CCGCTGCGTC GCCTGCCATC AGGCGGAGGC CGAGGCCTGG
CGCGGCTCGC ATCACGAGTT GGCGATGGCC GTGGCGGCAA CCGACAAGGT CCTGGGCGAC
TTCAACGACG CCACCTTCAC CGCGCATGGC GTGACCTCGC GCTTCTATCG CAAGGACGGC
AGCTATTTCG TGCGCACCGA TGGCCCGGAT GGCAAGTTGC AGGACTACCG GATCAAATAC
ACCTTCGGCT GGACGCCGTT GCAGCAGTAC CTGATCGAGT TGCCCAATGG CCATGTCCAG
GCGCTGGGCA TCGCCTGGGA TAGCCGGCCG GCGGCCGTCG GCGGCCAGCG CTGGTTCCAT
CTTTACCCCA ATGAGCCGAT GGATCACCGC CATCCGCAGC ACTGGACGGC GCGCAGCCAG
ACCTGGAACC ATCAATGTGC CGAGTGTCAC TCGACCAATC TGCAGAAGAA CTACGACCTG
GCGGCCGATC GTTACCGGAC GACCTGGAGC GAGATCAACG TCGCCTGCGA GGCCTGTCAC
GGTGCCGGCG GCAAGCATGC CGACTGGGCC GCACTGCCGG CGGCAAGGCG TCCGGCGGGC
GACAAGGGAC TGACGGTTTC GCTGGCTGCC GCGGCCACCA CCACCTGGGC CTTCGATCCG
GCCAGCGCTG CAGCGCAGGT CGAGGCCTGT GCCCGCTGCC ATTCGCGGCG CGGGCCGATC
TGGTCCGATG ACGGCGGTGG CCGCCCGCTG GGCAACAGCC ATCGCCTGGC CCTGCTCGAG
GAGCGGCTGT ACTTTGCCGA CGGCCAGATC AAGGACGAGG TCTTCGAGTA CGCCTCCTAC
ACGCAGAGCC GGATGCATGC GGCCGGCGTC GCCTGCACCA ACTGCCACGA GCCGCACAGC
CTGAAGCTGC GGGCCGAGGG CAATGCCCTG TGCGCCAGCT GCCATCCGGC GGCACGCTAC
GACACCCCGG CGCACCACCA TCACCCGGCG GGCAGCCCGG GGGCCAGTTG CACCAGCTGC
CATATGCCGC AGCGGGCGTA CATGGTCCAT GACTGGCGCG CCGACCACAG CATCCGTGTG
CCGCGTCCCG ATCTCTCCGT CAGGCTGGGC ACGCCGAACG CCTGTGCCGG ATGCCATGCG
CAGCAGGGCC ACGAATGGGC GGCCCGTGCC CTTTCCCGGT GGTATCCCGA GAGCCGGATG
CGCGGGCCGC ATTTTGCCGA AGCCTTCCAT GCGGCCGCCA CCGGTGCGGC CGACGGGGCT
GCCCGCTTGC TGGCGGTGGC GAGCGATCCG CAGCAGCCGG CGATTGTCCG CGGCAGCGCC
GCCAGCCGCC TGGCCGGCCT GGGGGCGGTG CCGCCGACGC CCGAGCTGCA GGCCTTGCTG
GCGGATCGGC AGCCGCTGGT GCGCGCCGCC TCGTTGCGCT TCCTCGAGGT GGCGGATGCG
CGCACCCGCT TCGAGCAGGG CTGGAGCAGC CTGCGCGACA GCGAGCGGAC GGTACGCCTC
GAGGCTGTCC GGGTTCTTGC CCCGCTGCTG CGCGAGCGAC TGCCGGCGGC CCAGCGGGAG
GAACTGCTGC GCGGCGTGGC CGAGTACGAA GCTTCGCTTC AGGTCAACGC CGATCTGCCC
GAGAGTCATG TCAGCCAGGG GCTGCTCGCC CTGTCGATGG GCGACGGCGA GCAGGCGGAA
CAGGCCTACC GGACGGCACT GCGGCTGGAT GCTCGTTTCG TCCCGGCCTA TGTCAACCTG
GCCGACCTTT ACCGCCTGCA GCAGCGCGAA GGCGAGGGCG AGCACCTGCT GCGCGAGGGC
ATCGACAGGA TCACCTTCGA TGCCGACCTG CGCCATACCC TCGGCCTCAA TCTGATCCGC
CAGCAACGCC GCGGCGAAGC CCTGCAGTGG CTGCGCGAGG CTGCCGAAGC GGAAAGCGCC
AATGCCCGCT ACAGCTATGT CTATGCCCTG GCCCTGCAGG GCAGTGGCGA CGGGGTAGGC
GCCCTGCGCA TCCTGCGTCA GGCGCAATCG CAGCATCCGG GCAATCGCGA TGTCCTTTTC
GCCCTGGCGA CGATCAGTCG CGACCAGAAG GACATGGTCA GGGCGCGCGC CTATGCCGAG
GAATTGCTCG AGCGCTTCCC GGGGGACCGG CAAGCCAAGG CTTTGTGCGA GACCTTGCGG
GAGCGATGA
 
Protein sequence
MRQLAVFILA LLLAPWSVAG SGYVGSTRCV ACHQAEAEAW RGSHHELAMA VAATDKVLGD 
FNDATFTAHG VTSRFYRKDG SYFVRTDGPD GKLQDYRIKY TFGWTPLQQY LIELPNGHVQ
ALGIAWDSRP AAVGGQRWFH LYPNEPMDHR HPQHWTARSQ TWNHQCAECH STNLQKNYDL
AADRYRTTWS EINVACEACH GAGGKHADWA ALPAARRPAG DKGLTVSLAA AATTTWAFDP
ASAAAQVEAC ARCHSRRGPI WSDDGGGRPL GNSHRLALLE ERLYFADGQI KDEVFEYASY
TQSRMHAAGV ACTNCHEPHS LKLRAEGNAL CASCHPAARY DTPAHHHHPA GSPGASCTSC
HMPQRAYMVH DWRADHSIRV PRPDLSVRLG TPNACAGCHA QQGHEWAARA LSRWYPESRM
RGPHFAEAFH AAATGAADGA ARLLAVASDP QQPAIVRGSA ASRLAGLGAV PPTPELQALL
ADRQPLVRAA SLRFLEVADA RTRFEQGWSS LRDSERTVRL EAVRVLAPLL RERLPAAQRE
ELLRGVAEYE ASLQVNADLP ESHVSQGLLA LSMGDGEQAE QAYRTALRLD ARFVPAYVNL
ADLYRLQQRE GEGEHLLREG IDRITFDADL RHTLGLNLIR QQRRGEALQW LREAAEAESA
NARYSYVYAL ALQGSGDGVG ALRILRQAQS QHPGNRDVLF ALATISRDQK DMVRARAYAE
ELLERFPGDR QAKALCETLR ER