Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3872 |
Symbol | |
ID | 7873523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4268323 |
End bp | 4270551 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700814 |
Product | cytochrome C family protein |
Protein accession | YP_002890837 |
Protein GI | 237654523 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.817906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCAAC TCGCCGTCTT CATTCTTGCC CTGCTGTTGG CCCCCTGGTC GGTGGCTGGC TCCGGGTATG TCGGCAGTAC CCGCTGCGTC GCCTGCCATC AGGCGGAGGC CGAGGCCTGG CGCGGCTCGC ATCACGAGTT GGCGATGGCC GTGGCGGCAA CCGACAAGGT CCTGGGCGAC TTCAACGACG CCACCTTCAC CGCGCATGGC GTGACCTCGC GCTTCTATCG CAAGGACGGC AGCTATTTCG TGCGCACCGA TGGCCCGGAT GGCAAGTTGC AGGACTACCG GATCAAATAC ACCTTCGGCT GGACGCCGTT GCAGCAGTAC CTGATCGAGT TGCCCAATGG CCATGTCCAG GCGCTGGGCA TCGCCTGGGA TAGCCGGCCG GCGGCCGTCG GCGGCCAGCG CTGGTTCCAT CTTTACCCCA ATGAGCCGAT GGATCACCGC CATCCGCAGC ACTGGACGGC GCGCAGCCAG ACCTGGAACC ATCAATGTGC CGAGTGTCAC TCGACCAATC TGCAGAAGAA CTACGACCTG GCGGCCGATC GTTACCGGAC GACCTGGAGC GAGATCAACG TCGCCTGCGA GGCCTGTCAC GGTGCCGGCG GCAAGCATGC CGACTGGGCC GCACTGCCGG CGGCAAGGCG TCCGGCGGGC GACAAGGGAC TGACGGTTTC GCTGGCTGCC GCGGCCACCA CCACCTGGGC CTTCGATCCG GCCAGCGCTG CAGCGCAGGT CGAGGCCTGT GCCCGCTGCC ATTCGCGGCG CGGGCCGATC TGGTCCGATG ACGGCGGTGG CCGCCCGCTG GGCAACAGCC ATCGCCTGGC CCTGCTCGAG GAGCGGCTGT ACTTTGCCGA CGGCCAGATC AAGGACGAGG TCTTCGAGTA CGCCTCCTAC ACGCAGAGCC GGATGCATGC GGCCGGCGTC GCCTGCACCA ACTGCCACGA GCCGCACAGC CTGAAGCTGC GGGCCGAGGG CAATGCCCTG TGCGCCAGCT GCCATCCGGC GGCACGCTAC GACACCCCGG CGCACCACCA TCACCCGGCG GGCAGCCCGG GGGCCAGTTG CACCAGCTGC CATATGCCGC AGCGGGCGTA CATGGTCCAT GACTGGCGCG CCGACCACAG CATCCGTGTG CCGCGTCCCG ATCTCTCCGT CAGGCTGGGC ACGCCGAACG CCTGTGCCGG ATGCCATGCG CAGCAGGGCC ACGAATGGGC GGCCCGTGCC CTTTCCCGGT GGTATCCCGA GAGCCGGATG CGCGGGCCGC ATTTTGCCGA AGCCTTCCAT GCGGCCGCCA CCGGTGCGGC CGACGGGGCT GCCCGCTTGC TGGCGGTGGC GAGCGATCCG CAGCAGCCGG CGATTGTCCG CGGCAGCGCC GCCAGCCGCC TGGCCGGCCT GGGGGCGGTG CCGCCGACGC CCGAGCTGCA GGCCTTGCTG GCGGATCGGC AGCCGCTGGT GCGCGCCGCC TCGTTGCGCT TCCTCGAGGT GGCGGATGCG CGCACCCGCT TCGAGCAGGG CTGGAGCAGC CTGCGCGACA GCGAGCGGAC GGTACGCCTC GAGGCTGTCC GGGTTCTTGC CCCGCTGCTG CGCGAGCGAC TGCCGGCGGC CCAGCGGGAG GAACTGCTGC GCGGCGTGGC CGAGTACGAA GCTTCGCTTC AGGTCAACGC CGATCTGCCC GAGAGTCATG TCAGCCAGGG GCTGCTCGCC CTGTCGATGG GCGACGGCGA GCAGGCGGAA CAGGCCTACC GGACGGCACT GCGGCTGGAT GCTCGTTTCG TCCCGGCCTA TGTCAACCTG GCCGACCTTT ACCGCCTGCA GCAGCGCGAA GGCGAGGGCG AGCACCTGCT GCGCGAGGGC ATCGACAGGA TCACCTTCGA TGCCGACCTG CGCCATACCC TCGGCCTCAA TCTGATCCGC CAGCAACGCC GCGGCGAAGC CCTGCAGTGG CTGCGCGAGG CTGCCGAAGC GGAAAGCGCC AATGCCCGCT ACAGCTATGT CTATGCCCTG GCCCTGCAGG GCAGTGGCGA CGGGGTAGGC GCCCTGCGCA TCCTGCGTCA GGCGCAATCG CAGCATCCGG GCAATCGCGA TGTCCTTTTC GCCCTGGCGA CGATCAGTCG CGACCAGAAG GACATGGTCA GGGCGCGCGC CTATGCCGAG GAATTGCTCG AGCGCTTCCC GGGGGACCGG CAAGCCAAGG CTTTGTGCGA GACCTTGCGG GAGCGATGA
|
Protein sequence | MRQLAVFILA LLLAPWSVAG SGYVGSTRCV ACHQAEAEAW RGSHHELAMA VAATDKVLGD FNDATFTAHG VTSRFYRKDG SYFVRTDGPD GKLQDYRIKY TFGWTPLQQY LIELPNGHVQ ALGIAWDSRP AAVGGQRWFH LYPNEPMDHR HPQHWTARSQ TWNHQCAECH STNLQKNYDL AADRYRTTWS EINVACEACH GAGGKHADWA ALPAARRPAG DKGLTVSLAA AATTTWAFDP ASAAAQVEAC ARCHSRRGPI WSDDGGGRPL GNSHRLALLE ERLYFADGQI KDEVFEYASY TQSRMHAAGV ACTNCHEPHS LKLRAEGNAL CASCHPAARY DTPAHHHHPA GSPGASCTSC HMPQRAYMVH DWRADHSIRV PRPDLSVRLG TPNACAGCHA QQGHEWAARA LSRWYPESRM RGPHFAEAFH AAATGAADGA ARLLAVASDP QQPAIVRGSA ASRLAGLGAV PPTPELQALL ADRQPLVRAA SLRFLEVADA RTRFEQGWSS LRDSERTVRL EAVRVLAPLL RERLPAAQRE ELLRGVAEYE ASLQVNADLP ESHVSQGLLA LSMGDGEQAE QAYRTALRLD ARFVPAYVNL ADLYRLQQRE GEGEHLLREG IDRITFDADL RHTLGLNLIR QQRRGEALQW LREAAEAESA NARYSYVYAL ALQGSGDGVG ALRILRQAQS QHPGNRDVLF ALATISRDQK DMVRARAYAE ELLERFPGDR QAKALCETLR ER
|
| |