Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2613 |
Symbol | |
ID | 7873354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2818502 |
End bp | 2819710 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699536 |
Product | cytochrome d1 heme region |
Protein accession | YP_002889592 |
Protein GI | 237653278 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.683986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTGCC GTGAAGTTCG TCTCGCACCG CCCGCCCTAT GGGGCGTGGT CCTCTCCGCC ATGGTCGTGC TGCTGTCGGC CTGTTCGAGC GTGCCGCCGG TCGCCGTGCG CGGCACCGGC GACCTCGGCG TGGTGATCGA GCGTGCCGAC GGCAAGGTCA AGGTGATCGA GACCACTGGC CGCAGCGTGC TGGCCACCGT GGATGGCCTG GGCGATCTCT CCCACGCCTC GGTGGTTTTC TCGCGCGATG GCCGCTACGC CTTCGTGTTC GGGCGCGACG GCGGGCTGAC CAAGGTCGAT CTGCTGGCGC GCCGGATCGT CGGCCGCAAG GTCCAGGCGG GCAATGCGAT CGGCGGCTCG ATCTCGCACG ACGGCAGCCT GGTGGTGGTG CAGAACTACG AACCGGGCGG CATCAAGGCC TTCGACGCCA ACACGCTCGA GCTCGTCGCC GATGTGCCTG CGACCACGGA CGACGGCACC CGCTCCAAGG TGGTGGGCCT CGCCGACCTC TCGGGCAAGC GCTTCATCTA TTCGCTGTTC GAGGCGGGCG AGATTCGCAT CACCGACTTC TCCGACCCGA AGAATCCGCT GACCCGGCGT TTTGCCGGCG GCAAGCAGCC CTACGATGCA CTGGTCACGC CCGATGGCCG GTACTACATC GCCGGCCTGT TCGGCGAGGA CGGCCTTGCC CTGATCGACC TGTGGAACCT CGACAAGGGC AGCCGCAAGA TCCTCTCCGG CTACGGTCGT GGCGAGCAGC CCCTGCCGGT GTTCAAGATG CCCCACCTGC GCGGCTGGTC GATCGCCGGC AATCGGGCCT ACCTGCCCGC CATCGGTCGC CACGAGGTGC TGGTGGTCGA CACCGCCACC TGGCAGGAGG TCGATCGCAT CCAGGTGAAG AGCCAGCCGG TGTTCGCCAT GGCGCGCCCC GACGGGCGCG AGATCTGGGT GAACTTCGCC TTCCCCGACA ACGGCTGGGT GCAGGTCATC GACACCGTCT CCGGCAAGGT CACCGACACC CTGCAGCCGG GCCGCGGCAT CCTGCACATG GAGTTCCTCT CCAAGGGCCA CGAGATCTGG CTGTCGGCGC GCGACGACAA CAAGGTCGTG ATCTACGACA CCGCCAGCAA GCAGCCGATC GGCGGCTTCG AGTCGGCGAG CCCGAGCGGC ATCTTCTTCA CCACGCGCGC CGCGCGCACG GGCTTCTGA
|
Protein sequence | MSCREVRLAP PALWGVVLSA MVVLLSACSS VPPVAVRGTG DLGVVIERAD GKVKVIETTG RSVLATVDGL GDLSHASVVF SRDGRYAFVF GRDGGLTKVD LLARRIVGRK VQAGNAIGGS ISHDGSLVVV QNYEPGGIKA FDANTLELVA DVPATTDDGT RSKVVGLADL SGKRFIYSLF EAGEIRITDF SDPKNPLTRR FAGGKQPYDA LVTPDGRYYI AGLFGEDGLA LIDLWNLDKG SRKILSGYGR GEQPLPVFKM PHLRGWSIAG NRAYLPAIGR HEVLVVDTAT WQEVDRIQVK SQPVFAMARP DGREIWVNFA FPDNGWVQVI DTVSGKVTDT LQPGRGILHM EFLSKGHEIW LSARDDNKVV IYDTASKQPI GGFESASPSG IFFTTRAART GF
|
| |