Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0884 |
Symbol | |
ID | 7084742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 976782 |
End bp | 978284 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643697907 |
Product | hypothetical protein |
Protein accession | YP_002354547 |
Protein GI | 217969313 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.151139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGCG AACCCCTGCT CCTCGCCGTG GCTCCGTCCG GCCTCCTCGC ATGCATCGGG AATGACGACG GCATCACCCT CCTCGCTGCC TTCACCTGCG GCGAGGAGGC CGCCTTCGCA AGCTGGCTCG CCCGCCGCCC GCCGGACGAG CCCTGCCGGA TGTTGGTCGA CCTGCCCGAC GAGGCCTACC AGATCGAGGA CCTGCCGCGC GTGCGCGGCA GCGACCGTCG CGCCCTGTTC GCGCGCCGAC TCGCTCACTG GTTCCCCGAG CCGCGCTTCG CACGCGCCAC GCCGCTCGGC GCCCTGCCCG ACGGACGCCA GGGCGCCGAG CGGGTGCTTT TCGCCGGGAT GGAGCGCAGC ACCGAACTCC TGCCCTGGCT GGATCGCCTC GCGGCCGACG GACGCCGCCC CCAAATCCTC GTCCCCGCCA GCGCCCTCCT GCCACGCCTG CCCCTGCCAG GAGCACGCCA GCGCCGCCAC GGCAAGGCGC CGCCACGGCC GCGCCTGCTC GCCACCCACG GGCGCGCGGG CCTGCGCATT TCGCTGCTGG CCGGCGAACA CACCCTGTTC TCCCGCCTCG TGCGCGGCCA CGCCGACTCC CTCGCCGATC CGCAAGCGCT GGCGACGGAG CTCGAGCGCA CCCGCGACTA CCTGGCCGCC CAGCACCGCA TCGCCAGCGA TGCGCCACTC GATCGCATCG TCGCGGACCT TCCGGCTCAG ACCGGGACGG AGCACGCCAC ACTGCCTCCG ATCGGCGTCG CCGCCGCCGA CACGCTCGCG GGCTGCGGAC TCCCGAACAG CTACGACGCC CATTTGCTGC TCGCCCTGCG CCACGCCCCC ACGACGATCG GCTGGCCGCT CGCGGCCGGC GCACGGCGCT GGCCCAGGCT GCCCGAAAGG CGCGTGCTCG GCGCGCTCGC GCTCGTCGCC TCGACGATCC TGGGAGCCCT CGTCTGGATG GAGCACCAGG CGCAGGTCGA GGCCGCAGCC CTGGCCGCGA CCGAGCGCGC CCGTGCGGCG CGTGCCGCCG CGCTTGCGGC CGAGGAGGCG GAACTCGCCG CACGCGAGGC CGAACGCGCC GCACTCGCCG CCCTCGATGC AGCGCCGCCG CCCCTTCCGA CCCCAGCGCC GGCCCCCGCC GAGCCTGCAC CGGCAGCCTG TCCGCCTGCC TCCAGCCCGC CGCCCGGGCC CATCGCCAGG CGCATCGACG GCGTCCTGCG TCGCCCGGAT GGCGAGATCC TGCTCTGGGT GGAGGGAACC TGGCAGTCCG CGCGCGCACT CGGCCTGCAC CCGATCGCAG GCGACGCCGC GGTGGTAGTG GCGGCCGGCC GGCGCACGCA GCTACGCAGC GGAGAGCTCG TGCCGACCGC CGTCGCGACG GTCACGGCCG GGCCTGGGCA GGCCTACCCC GAGCGCGACG ATGCGGCACC CGCAGATCGA GCAGCGGGCC CCGCCGACAG CGCTGCAAGG CCCGGTGTCG ACATGCGCGG AGCGCGGCCA TGA
|
Protein sequence | MRREPLLLAV APSGLLACIG NDDGITLLAA FTCGEEAAFA SWLARRPPDE PCRMLVDLPD EAYQIEDLPR VRGSDRRALF ARRLAHWFPE PRFARATPLG ALPDGRQGAE RVLFAGMERS TELLPWLDRL AADGRRPQIL VPASALLPRL PLPGARQRRH GKAPPRPRLL ATHGRAGLRI SLLAGEHTLF SRLVRGHADS LADPQALATE LERTRDYLAA QHRIASDAPL DRIVADLPAQ TGTEHATLPP IGVAAADTLA GCGLPNSYDA HLLLALRHAP TTIGWPLAAG ARRWPRLPER RVLGALALVA STILGALVWM EHQAQVEAAA LAATERARAA RAAALAAEEA ELAAREAERA ALAALDAAPP PLPTPAPAPA EPAPAACPPA SSPPPGPIAR RIDGVLRRPD GEILLWVEGT WQSARALGLH PIAGDAAVVV AAGRRTQLRS GELVPTAVAT VTAGPGQAYP ERDDAAPADR AAGPADSAAR PGVDMRGARP
|
| |