Gene Tmz1t_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1039 
Symbol 
ID7084023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1141120 
End bp1142730 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content65% 
IMG OID643698057 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_002354697 
Protein GI217969463 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCC AGGTCACCGG CACCCTCGAT GCACACGGCG CTGCAGGGCG TCCGCGCGAC 
CACGACCACA AGCCGCGCGG CCTCGCGCGC TGGCTGTTCA GCACCAACCA CAAGGACATC
GGCACGCTCT ACCTGCTGTT CTCGCTCGCC ATGCTGTTTA CCGGCGGCAG CCTGGCGATG
GTGATCCGCG CCGAGCTCTT CCAGCCCGGG CTGCAGTTCG TCGATCCGCA CTTCTTCAAC
CAGATGACGA CGGTGCACGG CCTGGTGATG GTGTTCGGCG CGGTGATGCC GGCCTTCGTC
GGCCTGGCGA ACTGGATGAT CCCGCTGATG ATCGGCGCTC CCGACATGGC GCTGCCGCGG
ATCAACAACT GGAGCTTCTG GATCCTGCCG TGCGCGTTCG CGATCCTGCT GTCCACGCTG
TTCATGGAAG GCGGCGCGCC GGCCGCCGGA TGGACCTTCT ACGCGCCGCT GTCGACCAAG
TACAGCGGAG ATTCGACCGC CTTCTTCGTG CTCGCGGTGC ACCTGATGGG GGTGTCCTCG
ATCATGGGGG CGATCAACGT CATCGTCACC ATCTGGAACA TGCGCGCGCC GGGCATGGGC
TGGATGAAGC TGCCGCTGTT CGTGTGGACC TGGCTCATCA CCGCCTTCCT GCTGATCGCG
GTGATGCCGG TGCTCGCCGG CGTCGTGACC ATGGTGCTCA CCGACAAGTA CTTCGGCACC
AGCTTCTTCG ACGCCGCGGG CGGCGGCGAC CCGGTGATGT TCCAGCACAT CTTCTGGTTC
TTCGGCCATC CGGAGGTCTA CATCATGATC CTCCCCGCCT TCGGCATCGT CTCCACCATC
ATCCCCACCT TCGCGCGCAA GCCGCTGTTC GGCTACGAGT CGATGGTGAT CGCCACCGCC
AGCATCGCCT TCCTGTCCTT CATCGTCTGG GGCCACCACA TGTTCACCAC CGGCATGCCG
GTGGTCGCCG AGCTGTTCTT CATGTACGCC ACCATGCTGA TCGCCGTGCC CACCGGCGTG
AAGGTGTTCA ACTGGGTGGC GACGATGTGG CGCGGCTCGA TGACTTTCGA GGTGCCGATG
ATGTTCTCGC TCGCCTTCAT CGTGCTGTTC ACCATCGGCG GCTTCTCCGG GCTGATGCTG
GCCATCATCC CCGCCGACTT CCAGTACCAG GACACCTACT TCGTCGTCGC CCACTTCCAC
TACGTGCTGG TGACCGGCGC GGTGTTCGGC ATCATCGCCG CGGTGTACTA CTGGATCCCG
AAGTGGACCG GCGTGATGTA CAACGAGCGC CTCGCCCAGG TGCACTTCTG GTGCTCGCTG
GTGTCGGTGA ACATGCTGTT CTTCCCGATG CACTTCGTCG GCCTCGCCGG CATGCCGCGG
CGCATTCCCG ACTACGCGCT GCAGTTCGCC GACCTCAACG CCTTCATGAG CATCGGCGGC
TTCCTGTTCG GCCTGTCGCA GCTGCTCTTC CTGTGGGGCG TGGTGCGCTG CATGCGCGGC
ATCGGCGACA AGGCCACCGA TCGCGTGTGG GAGGGCGCAC AGGGGCTGGA GTGGGAGGTG
CCGTCGCCCG CGCCTTACCA CACCTTCGAC ACTCCGCCGG TGGTCAAGTG A
 
Protein sequence
MSTQVTGTLD AHGAAGRPRD HDHKPRGLAR WLFSTNHKDI GTLYLLFSLA MLFTGGSLAM 
VIRAELFQPG LQFVDPHFFN QMTTVHGLVM VFGAVMPAFV GLANWMIPLM IGAPDMALPR
INNWSFWILP CAFAILLSTL FMEGGAPAAG WTFYAPLSTK YSGDSTAFFV LAVHLMGVSS
IMGAINVIVT IWNMRAPGMG WMKLPLFVWT WLITAFLLIA VMPVLAGVVT MVLTDKYFGT
SFFDAAGGGD PVMFQHIFWF FGHPEVYIMI LPAFGIVSTI IPTFARKPLF GYESMVIATA
SIAFLSFIVW GHHMFTTGMP VVAELFFMYA TMLIAVPTGV KVFNWVATMW RGSMTFEVPM
MFSLAFIVLF TIGGFSGLML AIIPADFQYQ DTYFVVAHFH YVLVTGAVFG IIAAVYYWIP
KWTGVMYNER LAQVHFWCSL VSVNMLFFPM HFVGLAGMPR RIPDYALQFA DLNAFMSIGG
FLFGLSQLLF LWGVVRCMRG IGDKATDRVW EGAQGLEWEV PSPAPYHTFD TPPVVK