Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3218 |
Symbol | |
ID | 7874439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3518988 |
End bp | 3520361 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643700152 |
Product | exodeoxyribonuclease VII, large subunit |
Protein accession | YP_002890190 |
Protein GI | 237653876 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1570] Exonuclease VII, large subunit |
TIGRFAM ID | [TIGR00237] exodeoxyribonuclease VII, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCGA ACCCCCTCTT GCCCGCAAAC GGCGCCGCGA CGCCCGCCGC GCAGGTCCTC AGCGTCTCCG AACTCAACCG TATGGCGCGC GAGCTGCTCG AATCCGCGCT GCCCCTGATG TGGGTGGGCG GCGAGATCTC CAACCTCGTC CGCGCCGCCT CCGGCCATGT CTACTTCACC CTCAAGGACG CCTCCGCCCA GGTGCGCTGT GCGATGTGGC GCAACCGCGC GCAGCTGCTC GCCTTCCGCC CCGAGAATGG CATGCGCGTC GAGGCGCGCG CGCTCGTCAC CCTGTACGAG GCGCGCGGCG ACTACCAACT GAGCGTCGAG GCGCTGCGCC CGGCCGGTAT CGGCAGCCTG TTCGAGGCCT TCAACCGGCT CAAGGCCAAG CTCGCCGCCG AAGGCCTCTT CGACGAGGCC GGCCGGCGCG CGCTGCCGCG CTACCCGCGC GCGCTCGGTA TCGTCACCTC GCCGCAGGCC GCCGCGCTGC GCGACGTGCT GGTGACGCTG CGCCGGCGCG CGCCGCATCT GCCGGTGGTG CTCTACCCCG CGCCGGTGCA GGGCGCCGAC GCTCCCGCGC GCCTGCTCGA GGCGGTCCGC TGCGCCGGCC GGCGCGCCGC CGAAGATGGC GTCGATGTGC TGCTGCTGGT GCGTGGCGGC GGCAGCATCG AGGACCTGTG GGCATTCAAC GACGAAGCCC TGGCGCGTAC CCTGCGCGCC TGCCCGCTGC CGGTGGTGTG CGGCGTGGGC CACGAGACCG ACTTCACCAT CGCCGACTTC GCCGCCGACC TGCGCGCACC CACGCCGAGC GGCGCCGCCG AGCTCGCCAG CGCCGGCTGG TACGCTGCGC GCGCCGAACT CGCAGTGCTC GAACCGCGCC TGCGCCGCGC CGTCGAGCGC CGCTTCGGCG AGCTCGCGCA GCGCCTGGAC CGCGCCGCGC TGCGCCTGGT GCATCCGCGC GAACGCCTGC GTCGCGAGCG CGACACGCTG GCGCGCCTGG GCGAGCGCCT GCACCACGCC ACCGCGCGTC GGCTCGAGGC CGCCGACCTG CGCGCCACCC GCGCCGGGCT CCGCCTGCGT GCCGCAGCGC CGCGCCCGCA AGCGCTCGCC GCGCGCGTCG ACATGCTAGG TGGGCGCCTC GCGCGCGCAG CCACACGCCT GCTGGAAGCC CGCAGCCAGC GCCTCGACGC CCTCGCCGCC CACCTCCAGC ATCTCGCGCC ACAGGCGGTA CTGGCGCGCG GCTATGCCAT CGCCCGTGAC AAGCAAGGCC GCGTCCTGCG CAGCACCGCC GGCATCCCCG AGGGCGCCGC GGTGAGCGTG CAGCTCGCGG ACGGCCGCCT CGACACCCGG GTCATCGGCC ACGGGAAGGC CTGA
|
Protein sequence | MPANPLLPAN GAATPAAQVL SVSELNRMAR ELLESALPLM WVGGEISNLV RAASGHVYFT LKDASAQVRC AMWRNRAQLL AFRPENGMRV EARALVTLYE ARGDYQLSVE ALRPAGIGSL FEAFNRLKAK LAAEGLFDEA GRRALPRYPR ALGIVTSPQA AALRDVLVTL RRRAPHLPVV LYPAPVQGAD APARLLEAVR CAGRRAAEDG VDVLLLVRGG GSIEDLWAFN DEALARTLRA CPLPVVCGVG HETDFTIADF AADLRAPTPS GAAELASAGW YAARAELAVL EPRLRRAVER RFGELAQRLD RAALRLVHPR ERLRRERDTL ARLGERLHHA TARRLEAADL RATRAGLRLR AAAPRPQALA ARVDMLGGRL ARAATRLLEA RSQRLDALAA HLQHLAPQAV LARGYAIARD KQGRVLRSTA GIPEGAAVSV QLADGRLDTR VIGHGKA
|
| |