Gene Caul_0321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0321 
Symbol 
ID5897595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp363036 
End bp365840 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content65% 
IMG OID641560805 
ProductTonB-dependent receptor 
Protein accessionYP_001681956 
Protein GI167644293 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0815676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGTC GTCCATCACA CCGGGGGGCC ATCATGGCCG GCGCGGGCAT TGTCGTCCTG 
CTCACCGCGA GCGCGGCGAT CGCGCAGGTT TCCGCCAGCG CCCCTGATGA CGAGACGGTC
GAGGAAGTCG TCGTCGTTGG CGTGCGCGCC AGCGTGGCCA AGGCCGTCAA GCTCAAGCGC
GACGCCACCA CGGTTCAAGA CTCGATCAGC GCCCTGGAAC TGGGCATGTT CCCGGACGAC
AACGTCGCCG ACTCCCTCAG CCACATCACC GGCGTGTCGA TTTCGCGCAC CGCCGGCGGC
GAGGGCCAGA AGGTCAGCGT CCGCGGCCTT GGTCCCGAAT ATACGCTGTC GACCTTCAAC
GGACGCATCC TGGCCACCGA TGGCGCGGGC CGTGACTTCG CCTATGATGT GCTGCCAGCC
GATGTCATCA GCGGCGCCGA CGTGATCAAG GGCGCCGAGG CGGCCAACAC CGAAGGCGCC
ATCGGCGGCC TGATCAACCT GCGCTCGGCG AGCCCCTTCG ACAGGCGCGG CCAGCAGGGC
ATCGTGCGCC TCGAGGGCGA CCGCAACCAG ATGTCGGAAC TGGACGGTCG CAAGCTGTCG
GCCGTCTACA GCAACACCTT CGCCGGCGAC ACGCTGGGCC TGCTGCTCGG CGTGGTCCTC
GAAAAGCGCG ACGACCGCAC CGACGTGGCC GGCAATGACG GCGGCTGGAC GCGCAACGCG
GATCCGACCG ACGTAAGCTG GCTGTATGGC AACGCCTGGG GCGGCTCCAT CGACCCGAAC
AACAACGGCG TGCTGGACCC CGAGGAATAC GGCCTGATCG GTCCGGGTCA GTTCCGCGTC
GGTTCAATCC TGGAAAAGAA GAAGCGCCAC GCCTTCACCG GCAAGCTTGA GTGGCGGCCG
TCCGATAACT TCAAGCTGGT CGTCGATGGA CTGAGCACCA AGCTCGATTC CCCCCAGGTC
GGCTACCAGC AGTCCTTCTA CCCGCTCTAC GCGCCGGGCC GCTGGTCGAA CATGGTGGTC
AAGAACGGCA TCGTCACCAG CTTCGACATG AACAATCCCG ATCCCGAGAT GCGTCTCAAT
CCGGAGCTGC TCAACAAGAC CGAGTTCCGC GTCGTCGAGA CGCAGCTCTA TGGCGCCAAC
GCCGAATGGA AGGTGTCGGA CACCCTGACC ATCACGGGCG ACGTCTACCG CTCGACCTCC
AAGCGCCACT CGGGCGGGCA GGACAGCTAT GTCGTCCTGC GCATGAACCA GCCCAACAGC
GCCCATATCG AGCTGACCGG GGAACGCGTG CCCAACGTCA CGGTCAATTT CGACGACGGC
CGCGACCTGG CCAGCGGCCT GGAGAAGGGC CTGTTCCACG ACTCCGACTT CAACACCCAC
TACTTCTCGC TCGCCGGCGA CAATATCGAC GACAAGATCA CCGGCGCCTC GTTCAAGGGC
GCCTGGGCGA CAGGGCGCGG CTGGCTCGAC AACGTGCAGT TCGGGGTCAA CTACACCGAC
CGCAAGAAGT CGCGCGACCT GGTCAACAAT GCGCTGACCG GCGGGGCCGA CTACTATTCC
GGCGACTACG CGATCAACGT GGGCGCGCTC GGCGGCAACG TCATTTCCGA CAGCTTCTCC
CTGCCCCACT TCATGAGCGA GGTGGACTCC AAGTTCCCGC GCACGTTCCT GTCCTTCGAC
ATCCCCAAGT ATCAAGCGGC GCTCGCGGCC TATAACGGCA AGCCACGCCC GGGCGGCGGC
ACGTACGACT ATTCCAAGGC CGCGCCGGCC TGGAATCCCC TGCAAAGCTA CCGCGTGGGC
GAAGAGACCT GGGCCGGCTT CGTGCAGGCC AATCTGGAGG GCAAGCGCTG GAGCGGCAAT
GTCGGCCTTC GCGTCGTGCG CACCAAGACC AACGCCCAGG CCTGGGACGC CAAGATCCTG
CAAGTCATCG AAAACGGCGC GTTCAACTAC ACGGCTGTCT ACGCCGCGCC GACCTCCATC
GAGCAGAGCA ACACCTACAC CTACGCGCTG CCGTCGCTGA ACCTCAACTA CCGCTTCACC
GAAGAGCTGC GCCTGCGCTT TGGCGCGGCC AAGACCATGG CCCGTCCGTC GGTCGCGACC
CTGGCGCCGA CCAACACCAC CGAAAGCGTC TCGTGGGGCG AGTTCACCCA GATCTACAGC
GGCAACGCCG AGCTCAAGCC CTATCAGGCC AAGCAGTTCG ACCTGTCGCT GGAATACTAT
TTCCGCCCCA ACTCGGTGTT CAACGTGGCG GTGTTCCACA AGCACATCAC CGACCAGATC
ACCACCAGCT GGGAACCGGG CCAGGATATC GGCGTACCCG GCCACCTGTT CAACATCAGC
CGTCCGATCA ACGGCGACTA CGCCAAGGTG AAGGGCGTCG AGGTCGGCCT GCAGCACTTC
CTGGACAACG GCCTGGGCGT GCGCGCGCAA TATACGCGTA ACTGGGCCAA GAGCTGGGTT
GGTGACCAGG AGCGTCCGCT CGAAGGCATC GCGCCCTCGG TCTATTCGCT GGGCGTCTTC
TACGACCACG GTCCGGTGTC GCTCAGCCTG TCGGGCGATC ACACCGCGGG CTTCACGACG
GCCGTCAACG TGCTCGGCGC CGGTTACAAC GAGAAGGCCG ACGCGATCAC CTGGGTCACG
GCCCACGCGT CGTACAAGAT CAACGACAAG ATGGACATCT CGCTCGAAGG CCAGAACCTT
CTGGACGAGG CCAACACCTA CAGCATCAAC GGCAACTCGA TGCTGCCGCA GGGCTACTAT
CGCTACGGGG CCAGCTACAA GCTGGGCCTG AGCTACCGCT TCTAG
 
Protein sequence
MSRRPSHRGA IMAGAGIVVL LTASAAIAQV SASAPDDETV EEVVVVGVRA SVAKAVKLKR 
DATTVQDSIS ALELGMFPDD NVADSLSHIT GVSISRTAGG EGQKVSVRGL GPEYTLSTFN
GRILATDGAG RDFAYDVLPA DVISGADVIK GAEAANTEGA IGGLINLRSA SPFDRRGQQG
IVRLEGDRNQ MSELDGRKLS AVYSNTFAGD TLGLLLGVVL EKRDDRTDVA GNDGGWTRNA
DPTDVSWLYG NAWGGSIDPN NNGVLDPEEY GLIGPGQFRV GSILEKKKRH AFTGKLEWRP
SDNFKLVVDG LSTKLDSPQV GYQQSFYPLY APGRWSNMVV KNGIVTSFDM NNPDPEMRLN
PELLNKTEFR VVETQLYGAN AEWKVSDTLT ITGDVYRSTS KRHSGGQDSY VVLRMNQPNS
AHIELTGERV PNVTVNFDDG RDLASGLEKG LFHDSDFNTH YFSLAGDNID DKITGASFKG
AWATGRGWLD NVQFGVNYTD RKKSRDLVNN ALTGGADYYS GDYAINVGAL GGNVISDSFS
LPHFMSEVDS KFPRTFLSFD IPKYQAALAA YNGKPRPGGG TYDYSKAAPA WNPLQSYRVG
EETWAGFVQA NLEGKRWSGN VGLRVVRTKT NAQAWDAKIL QVIENGAFNY TAVYAAPTSI
EQSNTYTYAL PSLNLNYRFT EELRLRFGAA KTMARPSVAT LAPTNTTESV SWGEFTQIYS
GNAELKPYQA KQFDLSLEYY FRPNSVFNVA VFHKHITDQI TTSWEPGQDI GVPGHLFNIS
RPINGDYAKV KGVEVGLQHF LDNGLGVRAQ YTRNWAKSWV GDQERPLEGI APSVYSLGVF
YDHGPVSLSL SGDHTAGFTT AVNVLGAGYN EKADAITWVT AHASYKINDK MDISLEGQNL
LDEANTYSIN GNSMLPQGYY RYGASYKLGL SYRF