Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3915 |
Symbol | |
ID | 7873561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4311251 |
End bp | 4313092 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700852 |
Product | cytochrome c biogenesis protein transmembrane region |
Protein accession | YP_002890875 |
Protein GI | 237654561 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCGCC TTTTCCTCCT GCTCGCCCTG ATCGCGAGCC TGCTCCTGCC CGCCGCCGCG CGCGCCAACC CGATCGAGCC CGAGAAGGCC TTCGCGATGC GCGCGCAGGC GCTCGACGCG CAGACCATCG AGGTGGTGTT CGAGATCGCC AAGGACTACT ACCTCTACGG CGACAAGTTC CGCTTCGAGG CCGAGCCCGC CGGCGTGGGC TTCGGCGCGA TGGAAAAGCC CGCGGGCAAG AAGCACAAGG ACGACTTCTT CGGCGAGGTC GAGACCCACC GCGGCGAGCT GCGCATCCTG GTCCCGGTAC AGGCGCCCGC CGGCACCACC CGCTTCGAGC TCTTCGCCAC CAGCCAGGGC TGCTGGGATG GCGGCATCTG CTATCCGCCC ACCACGCAGC AGGCCTCGAT CGACCTCGCC GCGCCGCCGA AGAAGGCGGG CGGCTTCCTC GACAGCGTGC TCGGCGGCCG CGCGGGCACC GGCAGCGCCG CTTCGCCCAC CCCGGTCGCG CTTGCCGCGC CGGAGGACGG CAGCGGCGGC GGCGCCGTCT CCAGCGACGA GAGCGGCGAC ATCGCCCGCC TGCTTTCCGG CGCCAGCGTG CCGATCATCC TGCTGAGCTT CTTCGGCTTT GGCCTGCTGC TCGCCTTCAC GCCCTGCACC TTCCCGATGA TCCCGATCCT GTCGGGCATC ATCGTCGGCC AGGGCAACAA GGTCTCGCAC ACGCGCGCCT TCGTGCTCTC GCTCGCCTAC GTGCTGGGCA TGGCGGTGAC CTACGCGCTC GCCGGGGTCG CCGCCGGCAT GACCGGCACC ATGCTGTCGG CAGCGCTGCA GAACGTGTGG GTGCTGTCGG CCTTCGCGCT GATGTTCGTG CTGCTGTCGC TGTCGATGTT CGGCTTCTAC GAGCTGCAGC TGCCCAGCAC GCTGCAGAGC AAGCTGGCCG ACACCGCCAG TCATGGCAAG GGCGGCCACC TCGGCGGCGT CACCCTGATG GGCGTGCTGT CGGCCCTCAT CGTCGGCCCC TGCGTGGCGG CCCCGCTCGC CGGCGCCCTG CTCTACATCG CGCAGACCGG CGACGCCGTG CTCGGCGGCT GGGCGCTGTT CGCGATGGGC TTGGGCATGG GCGCGCCGCT GCTGGCGGTG GGCGTGGCCT CGCGCAGCCT GCTGCCCAAG GTCGGGCCGT GGATGGAAGG CGTCAAGAAG GCCTTCGGCG TGATGCTGCT CGCGGTTGCG CTGTGGATGA TCACCCCGGT GATCCCGCCC CTGGCCACCA TGCTGGGCTG GGCCGCGCTG CTGCTGTTCT CGGCGATCTT CCTGCACGCG ATCGACCCGC TGCCGCCGCA GGCCAGGGGC TGGCAGCGCT TCTGGAAGGG CGTGGGCGTG GTGCTGTTGT TGGCCGGCGC GGCGATCCTG GTGGGCGCGC TCGCCGGCTC GCGCGACCCG CTGCAGCCCT TGTCGGTGCT GCGCGCGCAG GCCGCGGCGC CGGTCGATGT GCCGCAGTTC GAGAAGGTGG ACTCCATCGC CGAACTCGAG GCCCGGCTCG CCACCACCGA TCGCCCGGTG CTGCTCGACT TCTATGCCGA CTGGTGCGTG TCGTGCAAGG AGATGGAGCG CTTCACCTTC AGCGACGCCG CGGTCGCCGC GCGCATGAGC CGCATGCTGC TGCTGAAGGC CGACGTCACC GCCAACACCG ACGAGCACAA GGCGCTGCTC AAGCGCTTCG GCCTCTTCGG GCCGCCGGGC ATCCTCTTCT TCGACGCTGC GGGCAAGGAA CGCGAGGGCC TGCGCGTGGT GGGCTTCATG AAGGCGGCGC CGTTCGCGAC GGTGCTGGAC CGGGCGCTCT GA
|
Protein sequence | MHRLFLLLAL IASLLLPAAA RANPIEPEKA FAMRAQALDA QTIEVVFEIA KDYYLYGDKF RFEAEPAGVG FGAMEKPAGK KHKDDFFGEV ETHRGELRIL VPVQAPAGTT RFELFATSQG CWDGGICYPP TTQQASIDLA APPKKAGGFL DSVLGGRAGT GSAASPTPVA LAAPEDGSGG GAVSSDESGD IARLLSGASV PIILLSFFGF GLLLAFTPCT FPMIPILSGI IVGQGNKVSH TRAFVLSLAY VLGMAVTYAL AGVAAGMTGT MLSAALQNVW VLSAFALMFV LLSLSMFGFY ELQLPSTLQS KLADTASHGK GGHLGGVTLM GVLSALIVGP CVAAPLAGAL LYIAQTGDAV LGGWALFAMG LGMGAPLLAV GVASRSLLPK VGPWMEGVKK AFGVMLLAVA LWMITPVIPP LATMLGWAAL LLFSAIFLHA IDPLPPQARG WQRFWKGVGV VLLLAGAAIL VGALAGSRDP LQPLSVLRAQ AAAPVDVPQF EKVDSIAELE ARLATTDRPV LLDFYADWCV SCKEMERFTF SDAAVAARMS RMLLLKADVT ANTDEHKALL KRFGLFGPPG ILFFDAAGKE REGLRVVGFM KAAPFATVLD RAL
|
| |