Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1039 |
Symbol | |
ID | 7084023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1141120 |
End bp | 1142730 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643698057 |
Product | cytochrome c oxidase, subunit I |
Protein accession | YP_002354697 |
Protein GI | 217969463 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCC AGGTCACCGG CACCCTCGAT GCACACGGCG CTGCAGGGCG TCCGCGCGAC CACGACCACA AGCCGCGCGG CCTCGCGCGC TGGCTGTTCA GCACCAACCA CAAGGACATC GGCACGCTCT ACCTGCTGTT CTCGCTCGCC ATGCTGTTTA CCGGCGGCAG CCTGGCGATG GTGATCCGCG CCGAGCTCTT CCAGCCCGGG CTGCAGTTCG TCGATCCGCA CTTCTTCAAC CAGATGACGA CGGTGCACGG CCTGGTGATG GTGTTCGGCG CGGTGATGCC GGCCTTCGTC GGCCTGGCGA ACTGGATGAT CCCGCTGATG ATCGGCGCTC CCGACATGGC GCTGCCGCGG ATCAACAACT GGAGCTTCTG GATCCTGCCG TGCGCGTTCG CGATCCTGCT GTCCACGCTG TTCATGGAAG GCGGCGCGCC GGCCGCCGGA TGGACCTTCT ACGCGCCGCT GTCGACCAAG TACAGCGGAG ATTCGACCGC CTTCTTCGTG CTCGCGGTGC ACCTGATGGG GGTGTCCTCG ATCATGGGGG CGATCAACGT CATCGTCACC ATCTGGAACA TGCGCGCGCC GGGCATGGGC TGGATGAAGC TGCCGCTGTT CGTGTGGACC TGGCTCATCA CCGCCTTCCT GCTGATCGCG GTGATGCCGG TGCTCGCCGG CGTCGTGACC ATGGTGCTCA CCGACAAGTA CTTCGGCACC AGCTTCTTCG ACGCCGCGGG CGGCGGCGAC CCGGTGATGT TCCAGCACAT CTTCTGGTTC TTCGGCCATC CGGAGGTCTA CATCATGATC CTCCCCGCCT TCGGCATCGT CTCCACCATC ATCCCCACCT TCGCGCGCAA GCCGCTGTTC GGCTACGAGT CGATGGTGAT CGCCACCGCC AGCATCGCCT TCCTGTCCTT CATCGTCTGG GGCCACCACA TGTTCACCAC CGGCATGCCG GTGGTCGCCG AGCTGTTCTT CATGTACGCC ACCATGCTGA TCGCCGTGCC CACCGGCGTG AAGGTGTTCA ACTGGGTGGC GACGATGTGG CGCGGCTCGA TGACTTTCGA GGTGCCGATG ATGTTCTCGC TCGCCTTCAT CGTGCTGTTC ACCATCGGCG GCTTCTCCGG GCTGATGCTG GCCATCATCC CCGCCGACTT CCAGTACCAG GACACCTACT TCGTCGTCGC CCACTTCCAC TACGTGCTGG TGACCGGCGC GGTGTTCGGC ATCATCGCCG CGGTGTACTA CTGGATCCCG AAGTGGACCG GCGTGATGTA CAACGAGCGC CTCGCCCAGG TGCACTTCTG GTGCTCGCTG GTGTCGGTGA ACATGCTGTT CTTCCCGATG CACTTCGTCG GCCTCGCCGG CATGCCGCGG CGCATTCCCG ACTACGCGCT GCAGTTCGCC GACCTCAACG CCTTCATGAG CATCGGCGGC TTCCTGTTCG GCCTGTCGCA GCTGCTCTTC CTGTGGGGCG TGGTGCGCTG CATGCGCGGC ATCGGCGACA AGGCCACCGA TCGCGTGTGG GAGGGCGCAC AGGGGCTGGA GTGGGAGGTG CCGTCGCCCG CGCCTTACCA CACCTTCGAC ACTCCGCCGG TGGTCAAGTG A
|
Protein sequence | MSTQVTGTLD AHGAAGRPRD HDHKPRGLAR WLFSTNHKDI GTLYLLFSLA MLFTGGSLAM VIRAELFQPG LQFVDPHFFN QMTTVHGLVM VFGAVMPAFV GLANWMIPLM IGAPDMALPR INNWSFWILP CAFAILLSTL FMEGGAPAAG WTFYAPLSTK YSGDSTAFFV LAVHLMGVSS IMGAINVIVT IWNMRAPGMG WMKLPLFVWT WLITAFLLIA VMPVLAGVVT MVLTDKYFGT SFFDAAGGGD PVMFQHIFWF FGHPEVYIMI LPAFGIVSTI IPTFARKPLF GYESMVIATA SIAFLSFIVW GHHMFTTGMP VVAELFFMYA TMLIAVPTGV KVFNWVATMW RGSMTFEVPM MFSLAFIVLF TIGGFSGLML AIIPADFQYQ DTYFVVAHFH YVLVTGAVFG IIAAVYYWIP KWTGVMYNER LAQVHFWCSL VSVNMLFFPM HFVGLAGMPR RIPDYALQFA DLNAFMSIGG FLFGLSQLLF LWGVVRCMRG IGDKATDRVW EGAQGLEWEV PSPAPYHTFD TPPVVK
|
| |