Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0369 |
Symbol | infB |
ID | 4569347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 411989 |
End bp | 414964 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639764967 |
Product | translation initiation factor IF-2 |
Protein accession | YP_910852 |
Protein GI | 119356208 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0109319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTTG AGGACATGGA AAAGAGATAC CGGATAAGCG ATCTATCAAG AGAACTCCAG GTGAGTCCAC AGGAAGTCTT GCAGTTTATC AGAATGAACG GGGGCAAGGT TGGTTCAACA TCCTCTATGG TCAATGAAGA GATGCGCGGG ATGATTTTTG GTAATTTCAG TGTTGAAAAG AAAATGGTCG ATGAGGCCAT GAAGATCAGG GCTGAAAAAC AGCGTCGGCT GACAAGGCTT GAAGAGCAGT CGAGAAAAAC GTATGAGAAG GAACAGCAAT TGCGAGATTC CATGCATGTT GCTCCTCTTG TGCCTGTCGC ACCGTTGCAT GTTGCACAAG ATGTGATTGT TGAGGTTGCA GCCCCTCCAT CAGCTCAGGC GGATCATACA GTTCAGGCGG AACCGGCAGT TCAAACTGAA TCCGCAGTTC AAACTGAATC CGCAGTTCAA ACTGAATCCG CAGTTCAAAC GGAACCGGCA GTTCAAACGG AACCGGCAGT TCAAACGGAA CCGGAAGTTC AAACGGAGCC GGCAGCTCAA ACGGAACCGG AAGTTCAAAC GGAGCCGGCA GCTCAAACGG AGCCGGCAGC TCAAACGGAG CCGGCAGCTC AAACGGAATC CGCAGTTCAA ACGGAGGCCG ATCTTTCGGA TGTCGGTGAG GTTTCAATAG TTCCTGAGAA TGTCGAAGTC ATTGACGTTC CCGAATTGCC GATGGTACCG GTCATGCCGG TGAAGGAGGA GCCTTCGGTT AATGATCAGC TTGTATCATT TGATATTCCT CAGAATATCG GAGGGTTGAC GGTTGTTGGA ACCCTGGATA TGATGAATCC TTTTGATCGC AGTGAATCCG GCAAGATGAA GGCGAGAAAA AAGAATTTCA AAGAACAGGC AGACGCCCTG AAATCCGAGT TTGATACTCC CGCGGGAGAA GAAAAGCTTG TTGACGATAA GCTTGTTGTT AAAAAGAAAC CCGTAAAAGC TGCCGGAGAC GGCGACACAG CGCCAGCGGC GGACGACGCT CTTGCAGGAA AAAAGAAGCC GGGCAAGAAG AAGAAAAAAC CTGATGTTGA CGAAAAAGTG ATTTCGGCGA ACATTCGTAC GACTATCAGC GGAATGGATG ATAGCGCAGG ATCGGTATCA CGGCAGAAGT TCCGCAAGAT GCGGCGAATG GAGCGGGAAA AAGAGCATGA GGCCGCTGAA GCTTTTCGCG AGTCGCAGCG AGCGATCGTA AGGGTGACGG AATACGCTTC TCCTCACGAG CTTGCAGAGT TGATGGGAGT TACCGCAAAA GAGATCATAC AGAAATGTTT TGCGCTGGGT AAGTTCGTTA CTATCAATCA GCGTCTTGAT AAGGAGAGTC TCGAACTCAT TGCCCTTGAG TTTGGCTTTG AAGCTGAGTT CATCAGCGAA ATTGAGGCTA CGGCGGTTGT TGCCGAAGTT GACGATGCAG AGGATTTACT GATTCGTCCT CCCGTGGTTA CCATTATGGG CCATGTTGAT CATGGTAAGA CCTCACTGCT TGATTATATC CGTAACAGCA ATGTGGTTGC GGGTGAATCG GGAGGTATTA CCCAGCATAT AGGCGCTTAT GAGGTGACTG TTGAGGGGAA CAGAAAAATA ACCTTCCTTG ATACTCCCGG ACACGAAGCC TTTACTGCAA TGCGAGCAAG AGGCGCACAG GTTACCGATA TTGTTATTCT TGTTGTTGCC GCGGACGACA GCGTTATGCC GCAAACCATT GAGGCAATCA ACCATGCCAA GGCAGCAGGA GTTCCGATTG TCGTTGCGAT CAATAAAATT GATAAACCTG CAGCCAACCC TGAAAAAATC AAAACACAGT TGTCAGAAGC AGGCGTGCTT GTAGAGGACT GGGGCGGTGA GTATCAGTGC CAGGAAATAT CAGCCAAACA AGGTATCGGT ATTGAAGAGC TGATGGGGAA ATTGCTGACA GAGGCGGAAA TTCGTGAACT GAAAGGTAAT TTCTCGGAAG ATGTTCTCGC CAGTGGCATT ATTATCGAGT CTGAACTTGA TAAAGGCAAA GGTGTTATTT CAACGGTTCT TGTTCAGCGA GGATATCTGA GAGTCGGCGA TCCATTTGTA GCAGGGAATA CGATGGGCAG AGTCAGGGCG CTTATGGATG AGCGCAGCAA GAGAATTCAT GAGGCAGGTC CTTCACAACC GGTACGAGTT CTTGGATTTG AAGCACTTCC TCAGTCCGGC GATGTTCTCA CTGTGATGGC CTCCGATCGC GATGCAAGAG AATTGGCTCA GAAAAGGCAG GTGATTCGTC GTGAACATGA GTTCCGTAGA AGCACCAGAG TCAAGCTCGA CAGTATAGCC CGACAGATCA GAGAGGGGCT TATGAAGGAG TTGAGCGTTA TTATCAAGGC TGATACGGAT GGTTCGATCC AGGCCCTTGC CGATGGACTC ATGAAAATTC ATAACGAAGA GGTAAAAGTT CAGATTATTC ATCAGGGTGT CGGGCAGATT ACCGAGACTG ATGTATTGCT TGCTGCCGCA TCTGACGCTA TTATTATCGG ATTCAGGGTG AGGCCGAATG TCAATGCCAA AAAGCTTGCT GAAAAAGAAG ATCTCGATGT TCGTTTTTAC AGTGTTATCT ACCATGTGCT CGAGGATGTC GAAAAGGCGC TTGAAGGAAT GCTGTCACCG GAACTGCACG AGGAGAGCCT TGGATCGCTC GAAATCCGTC AGGTATTCAG AGTGCCGAAA GTGGGCAATG TAGGTGGTTG TTATGCTCTT GAGGGTAAGG TTTTTCGCGA TTCAAAGGTA CGGCTGCTTC GCGACGGGGT TCAGGTGTAC GACGGTCAGC TTGACACGCT TCGACGCTTT AAAGATGATG TCAAGGAGGT TGATGCCGGC TATGAATGTG GTCTCAGTCT GAAAAATTAT GACGATATCA AGGTTGGCGA TATTGTTGAG GCTTATAAGA TTGTCGAGAA AAAAAGAAAA CTCTGA
|
Protein sequence | MALEDMEKRY RISDLSRELQ VSPQEVLQFI RMNGGKVGST SSMVNEEMRG MIFGNFSVEK KMVDEAMKIR AEKQRRLTRL EEQSRKTYEK EQQLRDSMHV APLVPVAPLH VAQDVIVEVA APPSAQADHT VQAEPAVQTE SAVQTESAVQ TESAVQTEPA VQTEPAVQTE PEVQTEPAAQ TEPEVQTEPA AQTEPAAQTE PAAQTESAVQ TEADLSDVGE VSIVPENVEV IDVPELPMVP VMPVKEEPSV NDQLVSFDIP QNIGGLTVVG TLDMMNPFDR SESGKMKARK KNFKEQADAL KSEFDTPAGE EKLVDDKLVV KKKPVKAAGD GDTAPAADDA LAGKKKPGKK KKKPDVDEKV ISANIRTTIS GMDDSAGSVS RQKFRKMRRM EREKEHEAAE AFRESQRAIV RVTEYASPHE LAELMGVTAK EIIQKCFALG KFVTINQRLD KESLELIALE FGFEAEFISE IEATAVVAEV DDAEDLLIRP PVVTIMGHVD HGKTSLLDYI RNSNVVAGES GGITQHIGAY EVTVEGNRKI TFLDTPGHEA FTAMRARGAQ VTDIVILVVA ADDSVMPQTI EAINHAKAAG VPIVVAINKI DKPAANPEKI KTQLSEAGVL VEDWGGEYQC QEISAKQGIG IEELMGKLLT EAEIRELKGN FSEDVLASGI IIESELDKGK GVISTVLVQR GYLRVGDPFV AGNTMGRVRA LMDERSKRIH EAGPSQPVRV LGFEALPQSG DVLTVMASDR DARELAQKRQ VIRREHEFRR STRVKLDSIA RQIREGLMKE LSVIIKADTD GSIQALADGL MKIHNEEVKV QIIHQGVGQI TETDVLLAAA SDAIIIGFRV RPNVNAKKLA EKEDLDVRFY SVIYHVLEDV EKALEGMLSP ELHEESLGSL EIRQVFRVPK VGNVGGCYAL EGKVFRDSKV RLLRDGVQVY DGQLDTLRRF KDDVKEVDAG YECGLSLKNY DDIKVGDIVE AYKIVEKKRK L
|
| |