Gene Caul_4602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4602 
Symbol 
ID5902064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4976284 
End bp4978911 
Gene Length2628 bp 
Protein Length875 aa 
Translation table11 
GC content66% 
IMG OID641565121 
ProductTonB-dependent receptor 
Protein accessionYP_001686220 
Protein GI167648557 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.683128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.437508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGG TCTGGAAGAC CGTGCGGCAG GCCGCGCTGT CGGGTTCGTC GCTCATCGTA 
CTTTTCGCGG CCGGGCACGG CTCGGCGCAG ACGGCGAACA CGCCGCCCTC CGAACCGGCG
ACCGGGGCGT CGGAAAACGT GCTCGAAGAG CTCGTCGTCA CCGGATCGAG CATCCGGGGC
GTACCGCCAA TCGGCTCGAA TCTGATCAGC ATCTCACGCG ACGAGATCAA GGCGATCGGC
GCCGTCACGA CGCCAGATCT GCTGGCCACC GTGCCGCAGC TGAACAGCTT CAACACGGCG
CCGAGGGCCA ACAACGCCGG CGCGGGATCC TTCGCGCCCG GCATGCGCAG CCTGCCCGCG
ACGGCGACCC TGCCGCTCCT GAACGGTCAC CGGGTGGTGG CCGGAGGCGC GAACGAGACC
AACCCCGATT TCCCGTTCCT GCCGGATCTC GCGATCGAGC GCGTTGAGAT CGTCGCGGAC
GGCGCGTCGT CGATCTACGG CTCCGACGCC ATCGCCGGCG TGGTGAACTT CATTACCCGC
AAGCGCTATT CCGGCCTCCA AACGTCCGTG CGCTATGGCG CGGCCGACGA CTATCATACG
TTCAGCGTCA GCGGCCTGGC TGGCCGTGAC TGGGGTAGCG GGTCAGTTCT GGCCGCTTAT
CAGTATGCGG AAAGCGGCAA CGTCACGGGC GCCGATCGGG ACTATCGCAG CGTGGACTTC
CGTCCGTCCG GCGGGGTCGA CACCCGGGGG ACAAGCTGTC CTTCGCCGAA TGTGGTCGTG
GGCGCCACGG TCTATGCGGC GCCCGCCCTG GCGCCGAATA CGACCAACCA TTGCGATGTT
GGAGCCCCCG TCGACCTGGT CCCGGCCTCA CGTCTGCACA GCGCCTTTCT CACCGCTCGG
CAGGAAGCGG GCGATCACGT AACCCTTTGG GGCGACCTGC TGTACTCGGA CCGCAAGGAT
ACGGTCCGGG CGGCGATCCC CGCTCAGGCG GTGACGCTCA CGGCGGCCAA TCCGTTCTTC
AGGGCGCCGC CCGGCGTCGC GACGACTTTC GAGACCGTCT ACTTCCGTCC CGACAATCTG
GTTGGCGCGG ACCATTTCGA GCAAACCTAC CACGTGAAGG CGGGCAACGC GTCGGCCGGC
GCCGATGTGC GGCTGCCCCG CGACTTCAAC CTCAGCGCCT ACGGAACCTA TGACTGGGCG
ACCCATGTCG CCTTCCAGCC CATGATCAAT CCCGTGGCGC TGGCCGTGGC GGCGGCCGGC
GCCACCCCGG CGACGGCTCT GGATCCGTTC GGTCAAGGCA CGGCCCCGTC GGTTGCCGCG
GGCATCACCA ACTATTCGAC CAGCGTCACC ATCAAGCAAC GCACCAAGCT CGGCGCGATG
AAGATCGACG GGCCGCTGTT CGATCTCCCC GGCGGGGAGG TCAAGATCGC GGCAGGCGCC
GAATATCGGC GCGAGACGTT CGCTCAGCGG GGGTTCGTCG GAGCGACGCC CGTACCGGAA
AACCTGGGTC GCAACATCAA GTCCGTCTAC GGCGAGCTGT TCGTGCCGAT CTTCGGCGCC
GGTAACGAAA CGCGTTTCAT GCACCGGCTG TCGCTGTCCC TTTCGGGTCG CTACGATCAC
TACAGCGACT TCGGATCGAC GACCAATCCA AAGGTCGGCG TCAACTGGGA TCCGGTGGAA
GGCGTCACGT TCCGCGGGAC CTACGGCCGC TCCTTCCGGG CGCCTGGCCT GCGTGACGTC
GGCGCGACGG TCGGCGCATA TTACCTCGAC GCCGCGACAG CCGGCGTCGC CGCCCGCGAC
CCGACGCGCG GCGCCGCCCA GGTCAACACG GTCTATCTGC TCGGTGGAAA TCGCACCCTG
CAGCCTGAAA AGGCGAGAAC CTATTCGTTC GGCGTCGATG TGGCCCCGCC CTTCCTGCCG
GGCTTCCGGG CGAGCGCGAC GTTCTACGAT ATCAACTTCA CCGATGTGAT CGGAACGCCG
CCGACAAGCC TCGTCTTCAC CGACCCCACG TTCGCCTCGG TCGTCTATCG CAATCCTTCC
GCCACCCAGC TCGCCAGCTT GCTGGCGCTC GCCGTGCCGG TCAATCTGCC GTCGCCCTTG
CCCACGATCG GAAACGCGCT GGATCTGCGT CGCGGCAATT TCGGGGTTCG CAAGACCAGC
GGCCTGGACT TTGACGTCAA CTATCGGCGC CCGACCGGCT TTGGCTCCGT GTTCGTCGGC
GTCGCCGGGA ACTACATCTA CAAGTTCGAC ACCCAGCTTT CCCCCACGGC CCCCGTGTCG
AATGCATTGC GGCTGGGCGT GCCGCGCGCG ACCCTCAGGA CAACGGCCGG CTTCACGACG
GGTCCGGTCA GCGTGGCGAG CTACGTGAAC TATCGGGACG GCGTCACGAA CGCCTTCAAC
ACGCCCACCG GTGTCGGCGC GTACGAGGCC GATCCCTATA CGACGGTCGA TCTGCGGGTC
GCGGTGACCT TGCCGAACAC CGGGCTGACG TCGGGAACGG AACTGGCCTT GCAGGTCAAC
GATCTCTTCG ACAAGACGCC GCCGTTCTTC CCGGCGACGG ACGGGATCGG CGGCAACTAC
AATCCAATCG GGCGCTTCGT GGCCCTCAAT CTGCGGAAGA CCTTCTGA
 
Protein sequence
MAKVWKTVRQ AALSGSSLIV LFAAGHGSAQ TANTPPSEPA TGASENVLEE LVVTGSSIRG 
VPPIGSNLIS ISRDEIKAIG AVTTPDLLAT VPQLNSFNTA PRANNAGAGS FAPGMRSLPA
TATLPLLNGH RVVAGGANET NPDFPFLPDL AIERVEIVAD GASSIYGSDA IAGVVNFITR
KRYSGLQTSV RYGAADDYHT FSVSGLAGRD WGSGSVLAAY QYAESGNVTG ADRDYRSVDF
RPSGGVDTRG TSCPSPNVVV GATVYAAPAL APNTTNHCDV GAPVDLVPAS RLHSAFLTAR
QEAGDHVTLW GDLLYSDRKD TVRAAIPAQA VTLTAANPFF RAPPGVATTF ETVYFRPDNL
VGADHFEQTY HVKAGNASAG ADVRLPRDFN LSAYGTYDWA THVAFQPMIN PVALAVAAAG
ATPATALDPF GQGTAPSVAA GITNYSTSVT IKQRTKLGAM KIDGPLFDLP GGEVKIAAGA
EYRRETFAQR GFVGATPVPE NLGRNIKSVY GELFVPIFGA GNETRFMHRL SLSLSGRYDH
YSDFGSTTNP KVGVNWDPVE GVTFRGTYGR SFRAPGLRDV GATVGAYYLD AATAGVAARD
PTRGAAQVNT VYLLGGNRTL QPEKARTYSF GVDVAPPFLP GFRASATFYD INFTDVIGTP
PTSLVFTDPT FASVVYRNPS ATQLASLLAL AVPVNLPSPL PTIGNALDLR RGNFGVRKTS
GLDFDVNYRR PTGFGSVFVG VAGNYIYKFD TQLSPTAPVS NALRLGVPRA TLRTTAGFTT
GPVSVASYVN YRDGVTNAFN TPTGVGAYEA DPYTTVDLRV AVTLPNTGLT SGTELALQVN
DLFDKTPPFF PATDGIGGNY NPIGRFVALN LRKTF