Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3142 |
Symbol | |
ID | 7874284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3398058 |
End bp | 3399578 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700072 |
Product | cytochrome c family protein |
Protein accession | YP_002890116 |
Protein GI | 237653802 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.144797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGCGG CCGGCGGCAA GGCGGCGAGC ACGGCGGACC ACAGCAGGTT CAAGGAACTG CAGGGCCCGT TCGAGAGCGG GCCGGAGGTC ACCAAGGCCT GTCTCGCCTG TCACAACGAG GCCGGGCACC AGGTGATGAA GAGCGTGCAC TGGACCTGGG AGGCGATCAG CCCGACCACC GGCAAGAAGC TGGGCAAACA GCACGCGGCC AACAACTTCT GCGGCTCCAT CCTTTCCAAC GAGCCGCGCT GCACGAGCTG CCACGCCGGC TACGGGTGGA CGGACAAGAA CTACGATTTC ACCAACCAGA ACAACGTGGA TTGCCTGGCC TGCCACGACA CGACGACGAC CTACAAGAAG ATCGCCACCG ATGCCGGCCA TCCGCTCTAC GCGCCGCGCG AGCTGCCCAA GGGCAGCGGC AGGATCGTGC AGCCGCCCGA CCTGGCCAGG ATCGCGCAGG CGGTGGGCAA GCCGGGGCGG CACAACTGCG GCGCGTGCCA CTTCAACGGC GGTGGCGGCG ACGCGGTCAA GCATGGCGAC CTCGACTCCT CGCTGGTCAA GCCGCCGAAA CACGTCGATG TGCACATGTC GCCCGATGGC GAGAACATGA GCTGTGCGGA CTGCCACACC TTCAACGCCC ACCAGCCCTC CGGCAGCCGC TACGCGGCTA CCGCCAAGGA CACCAAGGGC CTGGACATGA AGCATGAGGA CCACAAGCGC GCGACCTGCG AGTCCTGCCA CGACCTTCGC CCGCACAAGA ACGAGCGCCT CGACAACCAC GCCAGGCGAG TGGCTTGCCA GACTTGCCAC ATCCCCGAGT TCGCGCGCGG CGGCATCGCC ACCAAGGTGC TGTGGGACTG GTCGACCGGC GGCAAGCGCG GTCCGGACGG CAAGCCGCTG TTCATCCAGG ACGACCACGG CCACCTGATC TACTCGGCGG AAAAGGGCGA CTTCAAGTAC GGCGAGAACG TGCGCCCGAC CTACAAGTGG TACAACGGCG TGGTCCATCA GATCGCGCTC ACCGACAAGA TCGACGACAG CAAGATCCTC GAGCTGAACC GCGTCGAAGG CTCGGCGAGC GACCCGAACG CGCGCATCTG GCCCTTCAAG GTGATGGTCG GCAAGCAGCC CTACGACCCG GTGAACAAGA CTCTGGTGGT GAACCACGTA TACGGCCAGG ACGACACCGC GTTCTGGGGA AACTTCGACT ACGCCAAGTC GATCAAGGCC GGCATGGACT ACGCCGGCCT GCCCTACAGC GGCCAGTTCG GCTTCATCGA GACGCGCATG AACTGGTTCA TCACCCACAT GGTGGCGCCC AAGGAAGACG CGCTCAAGTG CAGGGACTGC CACACCCGCT CGGAAGACGG CCGCCTCGCC GGGATCACCG ACCTCTACCT GCCCGGTCGC GACCGCAACG GCTGGGTCGA TCTCCTCGGT GGGCTGGCGA TCATCGGCGC GATCGGCGCC GGCTCGATCC ACGGCATCGC CCGCATCCTC ACCCGTCGCA AGGGGCACTG A
|
Protein sequence | MAAAGGKAAS TADHSRFKEL QGPFESGPEV TKACLACHNE AGHQVMKSVH WTWEAISPTT GKKLGKQHAA NNFCGSILSN EPRCTSCHAG YGWTDKNYDF TNQNNVDCLA CHDTTTTYKK IATDAGHPLY APRELPKGSG RIVQPPDLAR IAQAVGKPGR HNCGACHFNG GGGDAVKHGD LDSSLVKPPK HVDVHMSPDG ENMSCADCHT FNAHQPSGSR YAATAKDTKG LDMKHEDHKR ATCESCHDLR PHKNERLDNH ARRVACQTCH IPEFARGGIA TKVLWDWSTG GKRGPDGKPL FIQDDHGHLI YSAEKGDFKY GENVRPTYKW YNGVVHQIAL TDKIDDSKIL ELNRVEGSAS DPNARIWPFK VMVGKQPYDP VNKTLVVNHV YGQDDTAFWG NFDYAKSIKA GMDYAGLPYS GQFGFIETRM NWFITHMVAP KEDALKCRDC HTRSEDGRLA GITDLYLPGR DRNGWVDLLG GLAIIGAIGA GSIHGIARIL TRRKGH
|
| |