Gene Caul_2859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2859 
Symbol 
ID5900314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3102719 
End bp3105427 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content67% 
IMG OID641563356 
ProductTonB-dependent receptor 
Protein accessionYP_001684484 
Protein GI167646821 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.115194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0892472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTATCA CCTTGAACAA CAGATCGCGT CTGGCGCTGT CCGGCGCGTC GTTTCTAGCC 
TTGGCGCTGT TGGCCTCGGG GCCGGCCTGG GCCCAAACGT CGGACACGGC CGGATCGAGC
GAAGCGCCCA GGCAGCGGTC GGACGCCGAT CGCGCCGCGT CGAGCGTTCC GGAGCAGGTC
TCCGAAGTCG TCGTGACCGG ATCGAGTATC CGCGGCGTCC CGCCAACGGG CTCGAACCTG
ATCAGCGTCT CGCGTGAAGA CATCAGGACC ATCGGCGCGA ACACCACGCC GGACTTGCTG
GCCAGCGTGC CGCAGTTGAA CAGCTTCAAC ACCGCGCCGC GGGCCGCCAA TGGCGGGGCT
GGCGCGTTCG CGCCCGGCCT GCGCAGCCTG CCGGCCAGCG CGACGCTGCC GCTCATGAAC
GGCCACCGCT TGGTGGCTGG GGGCACGAAC CAGACTAATC CTGACTTCCC GTTCCTTCCC
GAACTCGCGA TCGAGCGCGT CGAGATCGTC GCGGACGGCG CCTCGGCGAT CTATGGTTCG
GACGCGGTCG CGGGCGTGGT CAACTTCATC ACCCGGCAGC GCTATTCAGG CGTCGATACG
TCCATCCGCT ACGGGACGGC CGACGACTAT CATACGTTCA GCGCGAGCGG CCTGGTTGGC
CGCAGCTGGG ATCGCGGATC CGTCGTGGCC GCCTACCAGT ATGCGGAGAA CGGCGACATC
CTGGGGGGCG ACCGCGACTA CCGGATCGTC GACCTGCGTC CCTATGGCGG CGTCGACACG
CGCACGACGG TTTGCCCGTC GCCCAACGTG CTGGTCAACA CCACGGCCTA CGCCGTCAAC
TACGCTGCGC CCGCCCTGGC TCCGAACACC ACGAACTCTT GCGACAACGG GGCTGTGACC
TCCTTGGTCC CCAAATCGCG CCTGCACAGC GGGTATCTCA CGGCCCGGCA GGACCTGAGC
GACCGCGTCA CCCTGTGGGG CGAGCTTCTC TATTCGGATC GCAAGGACAC GGTTCAGGCG
CCGCCGCCGC CCGGAGCCGG CGCGTCGGGC GGGGTGATCC TGTACAACCT CCCCGGCTTC
GCCAACCCCT TCTTCAAGAC GCCGCCGGGC TCCGGCGCCA CGACGGAGTT CGTCCAGTTC
CGCACCGACA ACCTTTATGG CGCGGACCAT ATAAATAACG TCTTCCGCGT GCGTGCAGGC
AACTCGTCGG CGGGCCTAGA CGCCAAGCTC CCTCGCGACC TGAAGCTGAC GGTCTCGGGG
ACCTATGATT GGGCCACGAA CGACACCTAC CTTCCCGCGA TCAACCCGGC CGCGCTCGAC
GCGGCGGCGC TGGGAACCAC GACGGCCACG GCGCTCGATC CGTTCGGCAA TGGCACGTCA
CCCGCGGTCG CGGCCAAGAT CACCGACTAC GCCGGCGACC TTGTCATCCA CCAGCGCACC
TATCTGGGCG CGGCCAAGAT CGACGGGCCG CTGGCCGACC TGCCGGGCGG GCAGCTGAAG
GTCGCGGTCG GCGCCGAGTA TCGGCACGAG ACCTTCCAGC AGCGGGGCAT CTACGGCGGG
TCCCAGGTGC CCGAAGACCT GACCCGCAAC ATCGGGTCGA TCTACGGCGA GCTGTTCGTG
CCCATTGTCG GCGACGGCAA CCCGGCGCCG TTGATCCGCA GCCTGGCCCT GTCGCTCTCC
GGTCGCTACG ACCACTACAG CGACTTCGGA TCGACGACGA ACCCGAAGGT CGGGGTCAAC
TGGGATCCAG TGCGCGGCCT GACGCTTCGC GGCACCTACG GCCGCTCGTT CCGGGCGCCG
GGCCTGCGCG ACGTCGGAGC CACCGTCGGC GCCTACTATT ACAACGCCGC AGCCATCGCC
GGCAGCACCT TCCGCGATCC CACGCGGGGC GCCGCCCAGG TCGACACCAT TTTCCTGCTC
GGGGGCAACC GCAACCTCCA GCCGGAAGAG GCCCGAACCT ATTCGATCGG CGCCGACCTA
CACCCGGACT TCCTGCCCAA CTTCCATGGC AGCGTGACGT TCTACGATAT CCGATACACC
AACGTGATCG GAACGCCCGG CGGCGCCATC GCCTTCACCG ATCCCACCTT CTCCTCGGTG
ATCTATCGCA ACCCCACCGC GGCGCAGCTC AACAGCCTGC TGGGGATCGC GGTGCCGGTC
AACTTCGTGC CGGGGGCCTT GCCAACGATC GGAAACTTGC TCGATTTCCG GCAGGGCAAT
TTCGGCGTCC GCAAGACCAA CGGCCTGGAT TTCGACCTCG GCTATCGCCA GCCGACGAGC
TTCGGCGCGC TGTATGCCGG CCTGGCCGGC AACTACATCT TCAAGTTCGA AACCCAGCTC
TCGCCGACCG CGCCGGTCTC CGACAGCCTG AAGCTCGGCA TACCCACGAC GACCCTGAGA
GGAACGCTCG GCGCCCAGGC AGGCCTCTTC AACGCGGTGG CGTTCGTCAA CTATCGCGGC
GGCGTGACAG GTCTGTACGC CACGCCCACC AGCGCCGCCG AATATAGCGC CAAGGCCTAT
ACGACCGTCG ATCTGCGCTT CTCGGTGAAG CTGCCCTACG GCGGTCTGGC CAAGAGGACC
GAGCTGGGCC TGCAGGTGAA CGACCTGTTC GATAAGGATC CGCCCTTCTT CCCGCAGGCC
GAAGGAATCG GCGGCGCCTA CAACCCGATC GGCCGCTACT TGGCGCTGAA CCTTCGGAAG
AGCTTTTGA
 
Protein sequence
MSITLNNRSR LALSGASFLA LALLASGPAW AQTSDTAGSS EAPRQRSDAD RAASSVPEQV 
SEVVVTGSSI RGVPPTGSNL ISVSREDIRT IGANTTPDLL ASVPQLNSFN TAPRAANGGA
GAFAPGLRSL PASATLPLMN GHRLVAGGTN QTNPDFPFLP ELAIERVEIV ADGASAIYGS
DAVAGVVNFI TRQRYSGVDT SIRYGTADDY HTFSASGLVG RSWDRGSVVA AYQYAENGDI
LGGDRDYRIV DLRPYGGVDT RTTVCPSPNV LVNTTAYAVN YAAPALAPNT TNSCDNGAVT
SLVPKSRLHS GYLTARQDLS DRVTLWGELL YSDRKDTVQA PPPPGAGASG GVILYNLPGF
ANPFFKTPPG SGATTEFVQF RTDNLYGADH INNVFRVRAG NSSAGLDAKL PRDLKLTVSG
TYDWATNDTY LPAINPAALD AAALGTTTAT ALDPFGNGTS PAVAAKITDY AGDLVIHQRT
YLGAAKIDGP LADLPGGQLK VAVGAEYRHE TFQQRGIYGG SQVPEDLTRN IGSIYGELFV
PIVGDGNPAP LIRSLALSLS GRYDHYSDFG STTNPKVGVN WDPVRGLTLR GTYGRSFRAP
GLRDVGATVG AYYYNAAAIA GSTFRDPTRG AAQVDTIFLL GGNRNLQPEE ARTYSIGADL
HPDFLPNFHG SVTFYDIRYT NVIGTPGGAI AFTDPTFSSV IYRNPTAAQL NSLLGIAVPV
NFVPGALPTI GNLLDFRQGN FGVRKTNGLD FDLGYRQPTS FGALYAGLAG NYIFKFETQL
SPTAPVSDSL KLGIPTTTLR GTLGAQAGLF NAVAFVNYRG GVTGLYATPT SAAEYSAKAY
TTVDLRFSVK LPYGGLAKRT ELGLQVNDLF DKDPPFFPQA EGIGGAYNPI GRYLALNLRK
SF