Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2255 |
Symbol | |
ID | 7083687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2541659 |
End bp | 2542573 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699274 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_002355890 |
Protein GI | 217970656 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.10497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCCGC GCCTGTCCTC CCGCCGCGAC GCGGTGCTGC GCGAGATGGG GCTCGGCCCG ATCTGGCGCC TGCGCACGCG GAAGGCGGAG GTCTCCGAAG CGGATGATGC GGAGCGTGAA GCGGCCGAGC CTGCATCGAC CGAAACCGAC GCGACCGCCT CGCCCCGGGT CGAGGTGCCG CGTACGCCGA ATCCTCCGCC AGCCGCACCG GCCCCGGCTG CCGGACCGCG ATCGAGCGTT GCGTTCCAAG CCGGCCCGCA CCGTGCCGCG CGCCCCCAGC CCGCGGCCGC TTCGACGCCC GCAGCCGTTC GCAGTGCGCC TGCGGCGCCG GATCCGGCTC GCGGCGCCCG CATCTCCACG CTCGAATGGG ACGCGCTCGA AGCCGAGATC CGCGATTGCA AGGCCTGTGG GCTGTGCGAG CGCCGCAAGC AGGCCGTGCC CGGCGTGGGC GACCGCCAGG CACGCTGGAT GCTGGTGGGC GAGGCGCCGG GCGCGGAGGA GGACCAGCGT GGCGAGCCCT TCGTCGGCCA GGCCGGGCGC CTGCTCGACA ACATGCTCGC GGCGATCGGG CTGAAGCGCG GAGAAGACGT CTATATCGCA AATGCGGTCA AGTGCCGACC GCCACACAAC CGCACTCCAG AACGCGGCGA GATCGCCGCC TGCCAGCCGT ATCTCGATCG CCAGATCGCG CTCGTGCAGC CGCAGCTGCT GGTCGCGCTC GGCCGTCCGG CCGCCCAGGC CCTGCTCGAT CGCGAGATCG CAATCTCGGC CGCGCGCGGC AAGCGCTTCG AGCGCGCGGG CACTCCGGTC GTGGTCACCT ACCATCCGGC CTATCTGCTG CGCAATCCGC AGGACAAGGC CAAGGCATGG GAAGACCTCT GCTTCGCGCG CCGGCTGATC GCCGAGACGG GGTGA
|
Protein sequence | MSPRLSSRRD AVLREMGLGP IWRLRTRKAE VSEADDAERE AAEPASTETD ATASPRVEVP RTPNPPPAAP APAAGPRSSV AFQAGPHRAA RPQPAAASTP AAVRSAPAAP DPARGARIST LEWDALEAEI RDCKACGLCE RRKQAVPGVG DRQARWMLVG EAPGAEEDQR GEPFVGQAGR LLDNMLAAIG LKRGEDVYIA NAVKCRPPHN RTPERGEIAA CQPYLDRQIA LVQPQLLVAL GRPAAQALLD REIAISAARG KRFERAGTPV VVTYHPAYLL RNPQDKAKAW EDLCFARRLI AETG
|
| |