Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2568 |
Symbol | |
ID | 7874007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2772290 |
End bp | 2773558 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699490 |
Product | hypothetical protein |
Protein accession | YP_002889547 |
Protein GI | 237653233 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0567655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCGCA TCATCGATCG GCGCTTCGAC AGCAAGAACA AGAGCGCGGT GAACCGCCAG CGCTTCATGC GCCGCTTCAA GCAGCAGATC CGCAAGGCGG TATCCGAGGC GATCCACGGC CGCTCCATCC GCGACCTGGA GAACGGCGAG CAGATCTCCA TTCCTGCGCG CGATCTCTCC GAGCCCTCTC TGCACCACGG CAAGGGCGGC ATCTGGGAGC AGGTCTTCCC CGGCAACGAC CAGTTCAGCA CCGGCGACCG CATCAAGCGC CCACTCGGTG GCGCTGGCGA CGGTGCCGGC AAGGGCAAGG CGAGCCAGGA CGGCGAGCAC GAGGACGACT TCGTCTTCCA GCTCTCGCGC GAGGAGTTCC TCGACCTCTT CTTCGAGGAC CTCGAGCTCC CCCGCCTGAT CCGCACCCAG CTAGCCAAGG TCACCGACTA CAAGACGCGG CGCGCCGGCT TCACGTCCGA TGGCGTGCCC GCGAACATCA ACATCGTGCG CTCGATGCGC GGAGCGCTCG GGCGCAGGCT CGCGCTCGGC TCGCCCTGGG CTGCACGCAT CCGCGCGCTG CAGCAGGAAC TCGACGAAGC GCTCGCGCGC GCCGGCGAAG ACAGCGAGGA AGTGCGCGAC CTGCGCGAGG AACTCGCCGC GCTGCGCGCG CGCATCGAAC GCATCCCCTT CATCGACCGC TTCGACCTGC GCTACAACAA CCGCGTCAAG GAGCCCCGCC CCACCACCCA GGCGGTGATG TTCTGCGTGA TGGACGTCTC CGGCTCGATG GACGAGGAGC GCAAGTCCAT GGCCAAGCGG TTCTTCATGC TGCTCTACCT GTTCCTCACG CGCAGCTACG AGCACATCGA GGTCGTCTTC ATCCGTCACC ACACCGTGGC CAAGGAAGTC GACGAGGACG AATTCTTCCA CTCGCGCGAA TCCGGCGGCA CCGTGGTCTC CAGCGCGCTC GAGCTGATGC GCAACATCCT GCGCGAGCGT TACGCCAACG GGCAGTGGAA CGTCTACGGC GCCCAGGCGT CCGACGGCGA CAACTGGGAC AACGACTCGC CGGTGTGCGG ACGCCTGCTC GGCAAGGAGA TCCTGCCCTG GTGCCAGTAC TTCGCCTACG TCGAGATCAC CGCCGGCGAG CCGCAGAACC TGTGGCGCGA GTACGCCAAG CTCGAGGCCG CGCACGACAA CTTCGCGATG CAGCGCATCG AGTCGCCGGC CGACATCTAC CCGGTGTTCC GCGAACTGTT CAAGAAGACG ATCGCATGA
|
Protein sequence | MVRIIDRRFD SKNKSAVNRQ RFMRRFKQQI RKAVSEAIHG RSIRDLENGE QISIPARDLS EPSLHHGKGG IWEQVFPGND QFSTGDRIKR PLGGAGDGAG KGKASQDGEH EDDFVFQLSR EEFLDLFFED LELPRLIRTQ LAKVTDYKTR RAGFTSDGVP ANINIVRSMR GALGRRLALG SPWAARIRAL QQELDEALAR AGEDSEEVRD LREELAALRA RIERIPFIDR FDLRYNNRVK EPRPTTQAVM FCVMDVSGSM DEERKSMAKR FFMLLYLFLT RSYEHIEVVF IRHHTVAKEV DEDEFFHSRE SGGTVVSSAL ELMRNILRER YANGQWNVYG AQASDGDNWD NDSPVCGRLL GKEILPWCQY FAYVEITAGE PQNLWREYAK LEAAHDNFAM QRIESPADIY PVFRELFKKT IA
|
| |