Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0033 |
Symbol | infB |
ID | 5897745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 39009 |
End bp | 42146 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641560516 |
Product | translation initiation factor IF-2 |
Protein accession | YP_001681669 |
Protein GI | 167644006 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.286863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG AGAACGATAA CGGTCGGCCT GGTTCCAACC CTGGCGGGCG AGCGCCCATC ACGCTGAAGC CCCGTCAGGG GTCTGTCAGC GCCGGCGTCG TGAAGCAGAG CTTCAGCCAC GGCCGCACCA AGACGGTGGT GGTTGAAACC AAGCGCGTGC GCCCGCATGC GCCGCCGGCC GGCAACCTTG CCGCGCCGTC CTCGGCCGAG CGCCGCCAGG GCGACGCGCC GCGTCAGCAA TCGTCTTCGG GCGGCGGTGG TTCGTCCGCC GGCGGTCTGT CGCAAGGCGA GATGCTGGCC CGCCAGCGCG CCATCGAGGC CGCTCGCGAA CACCAGGAAC GCCAAGCCGC CGAGCGCCGC GCCGCCGAGG CGCGCGCCGC TTCGGAAGCC GCCGCCGCTC GCGACGCCGC GGCCAAGTCG GCCGCCGCCG CCAAGGCCGC CGCCGCTCCG GCTCCGGAAG CGCCCGCCGC TCCGGCTCCG ACGCCCGCGC CGGTCGCTCA GGCGCCAGCC GCGCCGGTGG TTCAGGCCCC CGTCGTCGCC GCCCCGGTCC AGGCTCCGGC CGCTCCGGTC GCCGCCGCGC CGGCCGCGCC GCGCGCCGAG GCTCCCCGCC CCGCCCCGCC GCCGGTTCGC AGCGATGCGC CGCGTCCGGC TCCGACCGCG GGTCAGACCC GCACCTACGA GCCCAGCCGC GACCGTCGCG ACGACCGCTC GTCGACCACC ACCTATCGTC CGGGTCCTGG CGCTCCGCCG CAAGGCGATC GTCCCCAGGG CGACCGTCCG CAAGGCGATC GGCCGTTCAA CCAGCGCGCC CCGCGCCCGG ATGGCCCTTA CAACCAGCGC ACCCCGCGTC CCGACGCCGG CGGTCCGCCG CGCGGTCCCC GGCCCGAAGG GGCTGGCGGC TTCCGCAACG ACCGCCCGCA GGGCGACCGG CCGCAAGGCG ATCGTCCCCA GGGTGACCGT CCGCAAGGCG ACCGCCCGAC CCAGACCGTG CGCTATTCGG CCCTGGCCCC GCGCCCGGCG CCCGGCGCCC GTCCGGGTCC TGGCGGTCCA CGCGGCGCCC GTCCGGGCGT CCCGGCCTCG GCTCCCGCCA CCCCCGAGAT CCAGCGCGCC ACGCGTTCGG CTCCCCGCCC CGGCGGCGAC GTTGGTCGCA AGCCCGAGGA GGACGATGAC CGCCGCAAGG CCGCGGCTCC CGGCAAGGCC GTCAGCCGCG CCAAGGGCGC TCCGATCCGT CGGGAAGGTC GCCTGACCAT CCAGACCGTG GCCGGCGATG GAGACTCCGC CGACCGCATG CGCTCGCTGG CCTCGGTTCG CCGGGCCCGT GAACGTGAAA AAGAAAAGCG CCGCGGCGGC CCCGCCGATG TGGTCAAGGT CGCCCGCGAA GTGATTATTC CCGACGTCAT CACCGTGCAG GAGCTGTCCA ACCGGATGGC CGTGCGTGGT GTCGAGATCA TCAAGTTCCT GATGCGTCAG GGCGTGATGC TGAAGATCAA CGACGTCATC GACAACGACA CCGCCGAGCT GGTGGCGACC GAATTTGGTC ACACCGTGCG CCGCGTGTCG GAAGCCGACG TCGAAGAAGG CTTCATCGGC GCCGAGGATG TGGACGATCA CCTGGAGCCC CGGCCCCCGG TCGTCACCGT CATGGGTCAC GTCGACCACG GCAAGACCAG CCTGCTGGAC GCGCTGCGCA AGGCCGACGT GGCCAGCGGC GAGCACGGCG GCATCACCCA GCACATCGGC GCCTACCAGG TGCGTCTGGA AAGCGGCCAG AAGGTCACCT TCCTCGACAC CCCGGGCCAC GCGGCCTTCT CGCAAATGCG GGCTCGCGGC GCCAATATCA CCGACCTGGT GATCCTGGTG GTGGCCGGCG ACGACGGCGT CATGCCGCAG ACCGTCGAGG CCATCAAGCA CGCCCGCGCC GCGGAAGTGC CGATCATCGT GGCCGTCAAC AAGATGGACA AGCCCGGCGC CGACTCCACG CGGGTCGTCA ACGAGTTGCT GCAACACGAG ATCGTGGTCG AAAGCCTGGG CGGCGACACC CAGATCGTCG AAGTCTCGGC CAAGACCGGC CAGGGCCTGG ACGAGCTGAT CGAACGCATC CTGCTGCTGG CCGAGGTCAT GGACCTGAAG GCCAACCCCG ACCGCACCGC CGACGGCGTG GTGATCGAGG CCAAGCTGGA CAAGGGCCGC GGCGCTGTCT CCACCGTGCT GGTCAAGCGC GGCACGCTGA AGCGCGGCGA CATTGTCGTG GCCGGCAGCC AGTTCGGCCG GGTTCGCGCC CTGCTGAACG AGCGCAACGA GCAACTGACC GAAGCCGGTC CCGCCACCCC GGTCGAGATT CTCGGCCTGG ACGGCGTGCC GTCGCCCGGC GACCCCTTCG CGGTCGTCGA GAACGAGGCC CGGGCTCGCG AACTGACCGA GTACCGCATC CGCCTGAAGC GCGAGAAATC GATGCATCCG GTCGGCGCGG GCGCCACCAG CATGGCCGAC ATGATGGCCA AGCTGCAGGA CAAGAAGTAC CGCGAACTGC CGTTGGTCAT CAAAGCCGAC GTGCAGGGTT CGGCCGAGGC GATCATCGGT TCGCTGGACG CCATGTCGAC CGACGAGGTC CGCGCACGGA TCATCCTGTC GGGCGCCGGG GCGATCAGCG AAAGCGACGT GATGCTGGCC AAGGGCGCCG GCGCCCCGCT CATCGGCTTC AACGTCCGGG CCTCGGCCCA GGCGCGGGCG CTGGCCGAGC GCGAAGGCGT CGAGATCCGC TACTACGCGA TCATCTACGA CCTGCTGGAC GACATCAAAG GCGTGCTCTC GGGCATGCTG GCGCCGATCC AGCGCGAAAC CTTCCTGGGC AACGCCGAAG TGCTGCAAGC CTTCGACATC TCCAAGGTCG GCAAGGTCGC CGGTTGCCGC GTCACCGAGG GCGTGGTCCG CAAGGGCGCC AAGGTCCGGA TCATCCGCAA CGACATCGTC GTTCTGGAAC TGGGCACCCT GCAGACCCTC AAGCGCTTCA AGGACGAGGT CCCCGAGGTC CCGTCCGGCC AGGAGTGCGG CATGATGTTC GCCGGCTTCC AGGACATCAA GGTCGGCGAC ACCATCGAGT GCTTCACCGT CGAAGAGATC AAGCGCCAGC TCGACTAA
|
Protein sequence | MSDENDNGRP GSNPGGRAPI TLKPRQGSVS AGVVKQSFSH GRTKTVVVET KRVRPHAPPA GNLAAPSSAE RRQGDAPRQQ SSSGGGGSSA GGLSQGEMLA RQRAIEAARE HQERQAAERR AAEARAASEA AAARDAAAKS AAAAKAAAAP APEAPAAPAP TPAPVAQAPA APVVQAPVVA APVQAPAAPV AAAPAAPRAE APRPAPPPVR SDAPRPAPTA GQTRTYEPSR DRRDDRSSTT TYRPGPGAPP QGDRPQGDRP QGDRPFNQRA PRPDGPYNQR TPRPDAGGPP RGPRPEGAGG FRNDRPQGDR PQGDRPQGDR PQGDRPTQTV RYSALAPRPA PGARPGPGGP RGARPGVPAS APATPEIQRA TRSAPRPGGD VGRKPEEDDD RRKAAAPGKA VSRAKGAPIR REGRLTIQTV AGDGDSADRM RSLASVRRAR EREKEKRRGG PADVVKVARE VIIPDVITVQ ELSNRMAVRG VEIIKFLMRQ GVMLKINDVI DNDTAELVAT EFGHTVRRVS EADVEEGFIG AEDVDDHLEP RPPVVTVMGH VDHGKTSLLD ALRKADVASG EHGGITQHIG AYQVRLESGQ KVTFLDTPGH AAFSQMRARG ANITDLVILV VAGDDGVMPQ TVEAIKHARA AEVPIIVAVN KMDKPGADST RVVNELLQHE IVVESLGGDT QIVEVSAKTG QGLDELIERI LLLAEVMDLK ANPDRTADGV VIEAKLDKGR GAVSTVLVKR GTLKRGDIVV AGSQFGRVRA LLNERNEQLT EAGPATPVEI LGLDGVPSPG DPFAVVENEA RARELTEYRI RLKREKSMHP VGAGATSMAD MMAKLQDKKY RELPLVIKAD VQGSAEAIIG SLDAMSTDEV RARIILSGAG AISESDVMLA KGAGAPLIGF NVRASAQARA LAEREGVEIR YYAIIYDLLD DIKGVLSGML APIQRETFLG NAEVLQAFDI SKVGKVAGCR VTEGVVRKGA KVRIIRNDIV VLELGTLQTL KRFKDEVPEV PSGQECGMMF AGFQDIKVGD TIECFTVEEI KRQLD
|
| |