Gene Caul_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0033 
SymbolinfB 
ID5897745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp39009 
End bp42146 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content71% 
IMG OID641560516 
Producttranslation initiation factor IF-2 
Protein accessionYP_001681669 
Protein GI167644006 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0532] Translation initiation factor 2 (IF-2; GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00487] translation initiation factor IF-2 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.286863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG AGAACGATAA CGGTCGGCCT GGTTCCAACC CTGGCGGGCG AGCGCCCATC 
ACGCTGAAGC CCCGTCAGGG GTCTGTCAGC GCCGGCGTCG TGAAGCAGAG CTTCAGCCAC
GGCCGCACCA AGACGGTGGT GGTTGAAACC AAGCGCGTGC GCCCGCATGC GCCGCCGGCC
GGCAACCTTG CCGCGCCGTC CTCGGCCGAG CGCCGCCAGG GCGACGCGCC GCGTCAGCAA
TCGTCTTCGG GCGGCGGTGG TTCGTCCGCC GGCGGTCTGT CGCAAGGCGA GATGCTGGCC
CGCCAGCGCG CCATCGAGGC CGCTCGCGAA CACCAGGAAC GCCAAGCCGC CGAGCGCCGC
GCCGCCGAGG CGCGCGCCGC TTCGGAAGCC GCCGCCGCTC GCGACGCCGC GGCCAAGTCG
GCCGCCGCCG CCAAGGCCGC CGCCGCTCCG GCTCCGGAAG CGCCCGCCGC TCCGGCTCCG
ACGCCCGCGC CGGTCGCTCA GGCGCCAGCC GCGCCGGTGG TTCAGGCCCC CGTCGTCGCC
GCCCCGGTCC AGGCTCCGGC CGCTCCGGTC GCCGCCGCGC CGGCCGCGCC GCGCGCCGAG
GCTCCCCGCC CCGCCCCGCC GCCGGTTCGC AGCGATGCGC CGCGTCCGGC TCCGACCGCG
GGTCAGACCC GCACCTACGA GCCCAGCCGC GACCGTCGCG ACGACCGCTC GTCGACCACC
ACCTATCGTC CGGGTCCTGG CGCTCCGCCG CAAGGCGATC GTCCCCAGGG CGACCGTCCG
CAAGGCGATC GGCCGTTCAA CCAGCGCGCC CCGCGCCCGG ATGGCCCTTA CAACCAGCGC
ACCCCGCGTC CCGACGCCGG CGGTCCGCCG CGCGGTCCCC GGCCCGAAGG GGCTGGCGGC
TTCCGCAACG ACCGCCCGCA GGGCGACCGG CCGCAAGGCG ATCGTCCCCA GGGTGACCGT
CCGCAAGGCG ACCGCCCGAC CCAGACCGTG CGCTATTCGG CCCTGGCCCC GCGCCCGGCG
CCCGGCGCCC GTCCGGGTCC TGGCGGTCCA CGCGGCGCCC GTCCGGGCGT CCCGGCCTCG
GCTCCCGCCA CCCCCGAGAT CCAGCGCGCC ACGCGTTCGG CTCCCCGCCC CGGCGGCGAC
GTTGGTCGCA AGCCCGAGGA GGACGATGAC CGCCGCAAGG CCGCGGCTCC CGGCAAGGCC
GTCAGCCGCG CCAAGGGCGC TCCGATCCGT CGGGAAGGTC GCCTGACCAT CCAGACCGTG
GCCGGCGATG GAGACTCCGC CGACCGCATG CGCTCGCTGG CCTCGGTTCG CCGGGCCCGT
GAACGTGAAA AAGAAAAGCG CCGCGGCGGC CCCGCCGATG TGGTCAAGGT CGCCCGCGAA
GTGATTATTC CCGACGTCAT CACCGTGCAG GAGCTGTCCA ACCGGATGGC CGTGCGTGGT
GTCGAGATCA TCAAGTTCCT GATGCGTCAG GGCGTGATGC TGAAGATCAA CGACGTCATC
GACAACGACA CCGCCGAGCT GGTGGCGACC GAATTTGGTC ACACCGTGCG CCGCGTGTCG
GAAGCCGACG TCGAAGAAGG CTTCATCGGC GCCGAGGATG TGGACGATCA CCTGGAGCCC
CGGCCCCCGG TCGTCACCGT CATGGGTCAC GTCGACCACG GCAAGACCAG CCTGCTGGAC
GCGCTGCGCA AGGCCGACGT GGCCAGCGGC GAGCACGGCG GCATCACCCA GCACATCGGC
GCCTACCAGG TGCGTCTGGA AAGCGGCCAG AAGGTCACCT TCCTCGACAC CCCGGGCCAC
GCGGCCTTCT CGCAAATGCG GGCTCGCGGC GCCAATATCA CCGACCTGGT GATCCTGGTG
GTGGCCGGCG ACGACGGCGT CATGCCGCAG ACCGTCGAGG CCATCAAGCA CGCCCGCGCC
GCGGAAGTGC CGATCATCGT GGCCGTCAAC AAGATGGACA AGCCCGGCGC CGACTCCACG
CGGGTCGTCA ACGAGTTGCT GCAACACGAG ATCGTGGTCG AAAGCCTGGG CGGCGACACC
CAGATCGTCG AAGTCTCGGC CAAGACCGGC CAGGGCCTGG ACGAGCTGAT CGAACGCATC
CTGCTGCTGG CCGAGGTCAT GGACCTGAAG GCCAACCCCG ACCGCACCGC CGACGGCGTG
GTGATCGAGG CCAAGCTGGA CAAGGGCCGC GGCGCTGTCT CCACCGTGCT GGTCAAGCGC
GGCACGCTGA AGCGCGGCGA CATTGTCGTG GCCGGCAGCC AGTTCGGCCG GGTTCGCGCC
CTGCTGAACG AGCGCAACGA GCAACTGACC GAAGCCGGTC CCGCCACCCC GGTCGAGATT
CTCGGCCTGG ACGGCGTGCC GTCGCCCGGC GACCCCTTCG CGGTCGTCGA GAACGAGGCC
CGGGCTCGCG AACTGACCGA GTACCGCATC CGCCTGAAGC GCGAGAAATC GATGCATCCG
GTCGGCGCGG GCGCCACCAG CATGGCCGAC ATGATGGCCA AGCTGCAGGA CAAGAAGTAC
CGCGAACTGC CGTTGGTCAT CAAAGCCGAC GTGCAGGGTT CGGCCGAGGC GATCATCGGT
TCGCTGGACG CCATGTCGAC CGACGAGGTC CGCGCACGGA TCATCCTGTC GGGCGCCGGG
GCGATCAGCG AAAGCGACGT GATGCTGGCC AAGGGCGCCG GCGCCCCGCT CATCGGCTTC
AACGTCCGGG CCTCGGCCCA GGCGCGGGCG CTGGCCGAGC GCGAAGGCGT CGAGATCCGC
TACTACGCGA TCATCTACGA CCTGCTGGAC GACATCAAAG GCGTGCTCTC GGGCATGCTG
GCGCCGATCC AGCGCGAAAC CTTCCTGGGC AACGCCGAAG TGCTGCAAGC CTTCGACATC
TCCAAGGTCG GCAAGGTCGC CGGTTGCCGC GTCACCGAGG GCGTGGTCCG CAAGGGCGCC
AAGGTCCGGA TCATCCGCAA CGACATCGTC GTTCTGGAAC TGGGCACCCT GCAGACCCTC
AAGCGCTTCA AGGACGAGGT CCCCGAGGTC CCGTCCGGCC AGGAGTGCGG CATGATGTTC
GCCGGCTTCC AGGACATCAA GGTCGGCGAC ACCATCGAGT GCTTCACCGT CGAAGAGATC
AAGCGCCAGC TCGACTAA
 
Protein sequence
MSDENDNGRP GSNPGGRAPI TLKPRQGSVS AGVVKQSFSH GRTKTVVVET KRVRPHAPPA 
GNLAAPSSAE RRQGDAPRQQ SSSGGGGSSA GGLSQGEMLA RQRAIEAARE HQERQAAERR
AAEARAASEA AAARDAAAKS AAAAKAAAAP APEAPAAPAP TPAPVAQAPA APVVQAPVVA
APVQAPAAPV AAAPAAPRAE APRPAPPPVR SDAPRPAPTA GQTRTYEPSR DRRDDRSSTT
TYRPGPGAPP QGDRPQGDRP QGDRPFNQRA PRPDGPYNQR TPRPDAGGPP RGPRPEGAGG
FRNDRPQGDR PQGDRPQGDR PQGDRPTQTV RYSALAPRPA PGARPGPGGP RGARPGVPAS
APATPEIQRA TRSAPRPGGD VGRKPEEDDD RRKAAAPGKA VSRAKGAPIR REGRLTIQTV
AGDGDSADRM RSLASVRRAR EREKEKRRGG PADVVKVARE VIIPDVITVQ ELSNRMAVRG
VEIIKFLMRQ GVMLKINDVI DNDTAELVAT EFGHTVRRVS EADVEEGFIG AEDVDDHLEP
RPPVVTVMGH VDHGKTSLLD ALRKADVASG EHGGITQHIG AYQVRLESGQ KVTFLDTPGH
AAFSQMRARG ANITDLVILV VAGDDGVMPQ TVEAIKHARA AEVPIIVAVN KMDKPGADST
RVVNELLQHE IVVESLGGDT QIVEVSAKTG QGLDELIERI LLLAEVMDLK ANPDRTADGV
VIEAKLDKGR GAVSTVLVKR GTLKRGDIVV AGSQFGRVRA LLNERNEQLT EAGPATPVEI
LGLDGVPSPG DPFAVVENEA RARELTEYRI RLKREKSMHP VGAGATSMAD MMAKLQDKKY
RELPLVIKAD VQGSAEAIIG SLDAMSTDEV RARIILSGAG AISESDVMLA KGAGAPLIGF
NVRASAQARA LAEREGVEIR YYAIIYDLLD DIKGVLSGML APIQRETFLG NAEVLQAFDI
SKVGKVAGCR VTEGVVRKGA KVRIIRNDIV VLELGTLQTL KRFKDEVPEV PSGQECGMMF
AGFQDIKVGD TIECFTVEEI KRQLD