Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4070 |
Symbol | |
ID | 7873297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4469301 |
End bp | 4470830 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643701001 |
Product | HNH endonuclease |
Protein accession | YP_002891024 |
Protein GI | 237654710 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCGCAC CTCCCTCTCC TGCGTTCGCG CGTGCCTGGC GCCTGTCCGC CATCCTGGTG CTCGGCTTCG CCTCCGGCCT GCCGTTGGCG CTGACCGGCC AGGCCATGCA GGCCTGGCTG ACAGTCGACG GCGTGGACCT CGCCACGATC GGTTTTTTCG GCCTGGTCGG CGTCCCGTAT ACCTTCAAGT TCCTGTGGGC GCCGCTGATG GACCGCTTCG AGCCGCCCTG GCTGGGGCGC CGGCGCGGCT GGCTGGCGCT GACGCAGCTC GCGCTGGCGG CGCTGCTGTG GTGGATGGCG AGCCTGTCGC CGACGGCCAC GCCGGGCCTG TTCGCCATCG CGGCGGTGGC GATCGCCTTC CTCTCGGCCT CGCAGGACGT TGTGGTGGAC GCCTACCGCA CCGACCTGCT GCCCGAGGCC GAGCGCGGGC TGGGCGCCTC TGTGCACGTC TTCGCCTACC GCCTGGCGAT GATCCTGTCC GGCGGCATCG CGCTGATCTG GGCCGGGCAG TGGGCGTCGT GGCCGCGGGT ATACGAGACC ATGGCGCTGA TCATGGCGGC CTGCGCGGTG GTGTCGCTGC TGGCGCTGCC GCGCGTGTCG GCGGCGCTGA AGCCGCTCGA TTCCGACCCC AGGCGCGAGC TGCTCGGCTT CGCCGCGATG CTCGCCGGGG TCGCCGCCGG CTACTGGAGC GCCCGCCAGG CGCTGATCCT GCTCGGGCTC GACCCGAACG ACGCCAACCG CTGGATCCAG CTCCTGTTCG TGATGGCGGA GATCGCGCTC GCGCTGCCGC TGGCGGGCTG GGCGGCGCGT CGTGCCGGCT TCGAGACGCT CAACCGCTCG CTGTCGAGCT ACTTCGCGCA GCAGGGCGCG GCGGCCTTCC TGGCGCTGAT CATCCTCTAC AAGCTCGGCG ACGCCTTCGC CGGCAGCCTG ACCACGCCCT TCCTGATCAA GGGCATGGGC TTCTCGCAGG AAGAGGTCGG CATCGCCAAC AAGGTGATCG GCATCTGGCT GACCATCCTC GGCGCCTTCA TCGGCGGGCT GATCATGACG CGGCTGGCGC TGTACCGCTC GCTGCTGCTG TTCGGCGTGC TGCAGCTGGT GTCCAACTTC GGCTTCTACC TGCTCGCAGA GCTCGGCAAG GGCGCCTGGG GCGCAGTCAT GGTGCCGGCC TTCGACTGGG GCTTCGTGGC GATCGACACG CCGGCCGCGC TCGACTGGCT GCTGCTGACC GTGATCGCCG GCGAGAACAT CAGCGGCGGC ATGGGCACGG TGGCCTTCGT CGCGCTGCTG ATGGGGCTGT GCAACCAGCG CTTCACGGCG ACCCACTACG CCATGCTGTC GGCCTTCGCC GCAGTGGGGC GGATCTACGT CAGCCCGCTG TCGGGCGTGC TGTCGCAGAG CATCGGCTGG CCGGCCTTCT TCCTCTTCTC GATCGTGGTC GCCGTACCGG GCGTGGTGAT GGTGTGGTGG CTGCGCGACG CGCTCGCGCG CCTCGGCCGC CCGCAGACCG ACGGCATGGT GGACGACTGA
|
Protein sequence | MSAPPSPAFA RAWRLSAILV LGFASGLPLA LTGQAMQAWL TVDGVDLATI GFFGLVGVPY TFKFLWAPLM DRFEPPWLGR RRGWLALTQL ALAALLWWMA SLSPTATPGL FAIAAVAIAF LSASQDVVVD AYRTDLLPEA ERGLGASVHV FAYRLAMILS GGIALIWAGQ WASWPRVYET MALIMAACAV VSLLALPRVS AALKPLDSDP RRELLGFAAM LAGVAAGYWS ARQALILLGL DPNDANRWIQ LLFVMAEIAL ALPLAGWAAR RAGFETLNRS LSSYFAQQGA AAFLALIILY KLGDAFAGSL TTPFLIKGMG FSQEEVGIAN KVIGIWLTIL GAFIGGLIMT RLALYRSLLL FGVLQLVSNF GFYLLAELGK GAWGAVMVPA FDWGFVAIDT PAALDWLLLT VIAGENISGG MGTVAFVALL MGLCNQRFTA THYAMLSAFA AVGRIYVSPL SGVLSQSIGW PAFFLFSIVV AVPGVVMVWW LRDALARLGR PQTDGMVDD
|
| |