Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1692 |
Symbol | infB |
ID | 7084112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1898645 |
End bp | 1901551 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698713 |
Product | translation initiation factor IF-2 |
Protein accession | YP_002355343 |
Protein GI | 217970109 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.258602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGA TGAGCGTGAC CCAGTTTGCC GGCGAACTGA AAATGCCGGC CGCGGTGCTG CTCGAGCAGT TGAAGCGTGC GGGCGTGGAA AAATCCGGCC CCACCGACCT GCTCACCGAA CAGGACAAGG CGAGGCTGCT GGACTACCTG CGGCGTTCGC ACGGCGACAC CCAGCCCAAG GGCAAGATCA CCCTGACGCG CAAGCAGACC ACCGAGATCC GCGCCACCGA CTCCACCGGG CGTGCGCGCA CGGTGCAGGT CGAGGTGCGC AAGAAGCGCA CCTTCGTCAA GCGTGACGAG CTCGCCGAGC AGGCCGGGGT CGAGGCCGCG GCGGTCGAGC CGGCCGAAAT CGCCGAGGCG GTCGTCGAGG CGCCCGCGCC GGTCACCGAG GCCGCTGCGC CCGCCGTCGA GCCTGCCGTC GAAGCGGTGC CGGTTGTTGC AGCCGCCGCG CCGGAGGTGG TCGAGCCGCC GGCGGCCGAG GCCGTCGAGC CCGTCGCCGA GGCCGTAGCT CCGGTCGTCG AGCCGGCCCC CGCCGTCGCG CCGCCCCCCG CCGCCCCCGC CGCCAGCGCG GCGCGACCGG CCCAGCAGGG CGGTCGCCAT GGTCGTGGCG AGCGTCCGCA GCGCCAGGAG CCCGTCGCGC CCCAGCCGCA GGTCACCACC GTGCTGAGCA AGCCGCTGCC GCTCTCCGAG ATCCTCAGCG AGGAGGAGGT CGCCGCGCGC AAGCGTGACG AGGATCGTCA GCGCGCGCTC AAGGAGCGCC AGGCGGCCGA CCTGCGTGCC CGCCAGGAGC GCGAGGCTGC GGCCAAGGCC GCCGCCGAGG CGCGCAAGGC CGAGGAAGAA GCCCGCCTGC GTGCCGAGCA GCAGAAGAAG GAAGAGCCCG CCAAGGCGGC CGCCAAGCCG ACCACCGGCA CCCTGCATCG GCCCGCCAAG ACCGAGGACA AGCCCGCCAG CCGGGAAGTC AAGCGTACGG CCCCCGCGGC CCCGGCGCGC GAGGCCGACG GCGCCAGCAA GCGCCGCGGC GGCATGAAGA CGCGCGGCGA GGTCGGTGCC ACCACCGGCA GCAACTGGCG CGGCGCCGGC AAGGGTGGCG GCCGTCACGG CCGCAACCAG CAGGACGACC GTTCGTCCTT CCAGGCCCCC ACCGAGCCGA TCGTGCGCGA GGTGCACGTG CCCGAGACGA TCACCGTCGC CGACCTCGCC CACAAGATGG CGGTGAAGGC GACCGAGGTG ATCAAGGTCC TGATGAAGAT GGGCTCCATG GTCACCATCA ACCAGGTGCT CGACCAGGAA ACCGCGATGA TCATCGTCGA GGAGATGGGG CACCTGGCGG TTGCCGCGAA GCTCGACGAT CCGGATGCCT TCCTCGAGGA GAGCGAAGCG CACAAGGATG CCGACGTGCT GCCGCGTGCG CCGGTGGTCA CCGTGATGGG TCACGTCGAC CACGGCAAGA CCTCGCTGCT CGACTACATC CGCCGCGCCA AGGTCGCCGC GGGCGAGTTC GGCGGCATCA CCCAGCACAT CGGCGCGTAT CACGTCGAGA CCGCGCGCGG CATGCTCACG TTCCTCGACA CCCCGGGCCA CGAGGCCTTC ACGGCGATGC GTGCCCGCGG GGCGAAGGCG ACCGACATCG TCATCCTGGT GGTCGCGGCC GACGACGGCG TGATGCCGCA GACGCGCGAG GCGATCCACC ACGCCAAGGC GGCCGGCGTG CCGCTGGTGG TGGCGATCAA CAAGATCGAC AAGCCCGACG CCAACCCCGA GCGCGTGACC CAGGAGCTGA TCGCCGAGTC GGTGATCCCC GAGGCCTACG GCGGCGACAC CATGTTCGTG CCGGTGTCCG CCAAGAAGGG GACCGGCATC GACGAGCTGC TCGAGGCCGT GCTGCTGCAG GCCGAGGTGC TCGAGCTCAC CGCGCCCAAG GACACGCCCG CCAAGGGCCT GATCATCGAG GCGCGCCTGG ACAAGGGCCG TGGCCCGGTG GCCTCGCTGC TGGTCCAGTC GGGAACGCTG CGCAAGGGCG ACGTGCTGCT CGCCGGCGCC ACCTTCGGCC GCATCCGCGC GATGCTGGAC GAAAACGGCA AGCAGATCGA CGAGGCCGGT CCGTCCATCC CGGTCGAGAT CCTCGGCCTG TCGGACGTGC CGTCCGCGGG TGACGAGGCG ATCGCGCTGG CCGACGAGAA GAAGGCGCGC GAGATCGCGC TCTTCCGCCA GGGCAAGTTC CGCGACGTCA AGCTCGCCAA GCAGCAGGCG GCCAAGCTCG AGACGCTCAT GGAGCAGATG TCCGAGGGCG AGGTCAAGAG CCTGGCGCTC ATCATCAAGG CCGACGTGCA GGGTTCGCAG GAGGCGCTGG TGCAGTCGCT GCAGAAGCTC TCCACCGATG AAGTCCGCGT CAACGTCATC CACGGTGCGG TGGGCGCGAT CAGCGAATCC GACGTCAACC TGGCGCAGGC CTCGGGCGCG GTCATCATCG GCTTCAATAC CCGTGCCGAT GCCGGCGCGC GCAAGCTGGC CGAGACCTTC GGTGTCGATA TCCGCTACTA CAACATCATC TACGACGCCG TCGATGAGGT GAAGGCGGCG CTGTCGGGCA TGCTGGCGCC GGAGAAGCGC GAGGAAGTCA CCGGCCTGGT CGAGATCCGC CAGGTGTTCA CGATCTCCAA GGTCGGCTCG ATCGCGGGTT GCTACGTGCT CGAGGGTGTC GTGCGCCGCA ATTCGCACGT CCGCCTGCTG CGCAACCACA CCGTGCTGTG GACCGGCGAG CTCGAGTCGC TCAAGCGCTT CAAGGACGAC GTCAAGGAAG TCAAGTTCGG CTACGAGTGC GGCCTGCAGC TGCGCAACTA CAACGACATC CAGGAAGGCG ACCAGCTCGA GGTCTTCGAG ATCAAGGAAG TGGCGAGGAC CCTGTAA
|
Protein sequence | MEQMSVTQFA GELKMPAAVL LEQLKRAGVE KSGPTDLLTE QDKARLLDYL RRSHGDTQPK GKITLTRKQT TEIRATDSTG RARTVQVEVR KKRTFVKRDE LAEQAGVEAA AVEPAEIAEA VVEAPAPVTE AAAPAVEPAV EAVPVVAAAA PEVVEPPAAE AVEPVAEAVA PVVEPAPAVA PPPAAPAASA ARPAQQGGRH GRGERPQRQE PVAPQPQVTT VLSKPLPLSE ILSEEEVAAR KRDEDRQRAL KERQAADLRA RQEREAAAKA AAEARKAEEE ARLRAEQQKK EEPAKAAAKP TTGTLHRPAK TEDKPASREV KRTAPAAPAR EADGASKRRG GMKTRGEVGA TTGSNWRGAG KGGGRHGRNQ QDDRSSFQAP TEPIVREVHV PETITVADLA HKMAVKATEV IKVLMKMGSM VTINQVLDQE TAMIIVEEMG HLAVAAKLDD PDAFLEESEA HKDADVLPRA PVVTVMGHVD HGKTSLLDYI RRAKVAAGEF GGITQHIGAY HVETARGMLT FLDTPGHEAF TAMRARGAKA TDIVILVVAA DDGVMPQTRE AIHHAKAAGV PLVVAINKID KPDANPERVT QELIAESVIP EAYGGDTMFV PVSAKKGTGI DELLEAVLLQ AEVLELTAPK DTPAKGLIIE ARLDKGRGPV ASLLVQSGTL RKGDVLLAGA TFGRIRAMLD ENGKQIDEAG PSIPVEILGL SDVPSAGDEA IALADEKKAR EIALFRQGKF RDVKLAKQQA AKLETLMEQM SEGEVKSLAL IIKADVQGSQ EALVQSLQKL STDEVRVNVI HGAVGAISES DVNLAQASGA VIIGFNTRAD AGARKLAETF GVDIRYYNII YDAVDEVKAA LSGMLAPEKR EEVTGLVEIR QVFTISKVGS IAGCYVLEGV VRRNSHVRLL RNHTVLWTGE LESLKRFKDD VKEVKFGYEC GLQLRNYNDI QEGDQLEVFE IKEVARTL
|
| |