Gene Caul_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2355 
Symbol 
ID5899810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2553791 
End bp2556484 
Gene Length2694 bp 
Protein Length897 aa 
Translation table11 
GC content66% 
IMG OID641562846 
ProductTonB-dependent receptor 
Protein accessionYP_001683980 
Protein GI167646317 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA TCGACAAACG AACTCTGCTG GCGAGCGTCG CCGCCATGGC GACGCTGGTC 
GGCGCCGGAC AGGCTTTCGC CCAAGCGACC GAGACGGCCA CGGTCGAGGA GGTGATCGTC
ACCGGCAGCC GCATCAAGCG CGCCGACATC ACCGGGGTTG GGCCGGCGAC CGTGATTTCG
CAGGACCAGA TCGAGCGGAC GGGCCTGACC AATGTCGAGT CCCTGTTGCA GCGCCTGCCG
GCCTCGGCCG GCAACGCCGG CAACCAGACC AACGCCTATT GGACCGGCAA CGGCTACGGC
ACCGCCCAGG TCAATTTGCG CGGCCTGGGC ATCAACCGCA CCCTGACCCT GATCAACGGC
CGCCGGGTGG TGAACGGCGG CACCGGCGCC AACAGCGCGC CCGACCTGAA CATGATCCCG
ACAGCGATCA TCAGCCGCAT CGACGTGCTC AAGGACGGGG CCTCGGCGAT CTACGGCGCC
GACGCCGTGG CTGGCGTGGT CAACATCATC ACCCAGAACA ACTTCGAGGG CCTGAAGCTC
TCGGCCAAGT ACGGCGTCAC CGACGAGGGC GACGGCCAGG ACTATACGGT CGACGTGCTC
TGGGGCATGC GCAACGAACG CGGCGGGGTG ACGGCGGGGC TGACTTACCA GAAGACCGAG
GCGGTCAACC TGGCCAGCCG CGCGCCCTGC GGCCTGGGCG AGGTCGGCGG CCAACTGGTC
TGCATCTCCA GCGCCTCGAC GATCGGCGGC CGGGCCGTGC TGCCCAATGG CCAGCAGATC
AATTTCAACC AGACCCCGGG CGGCAACGGC AATTTCTACG AGCCCTACAG CGCCGCCAAG
CACAACTTCA ACTCCAACCC GTTCCTCAAC GCCGTCAGCC CGATCGAGCG GATCAGCACA
GCCTTCCTCG CTGACTACAA GCTGACCGAC ACCGTCACCC TGTTTGGCGA GCTGTTCTTC
ACCCACCGCG AAAGCCACCA GATCGCCACG CCGGGAACCC TGCGCAACCT GGCCATCTTG
GCCAGCAACC CGACCAATCC GACGGGCCAG AACATCACCT TGCTGCAGCG CCGCCTGGCC
GAACCTGGGC CGCGGATCTT CTCGCAGGAG ACCGACACCT ACCGCTTCGT CGGCGGCGCG
CGCGGTTCGC TGGCGGGTGA CTGGACCTGG GAGGCGGCGG CCAACTGGGG CCGCAACACC
GGCGTCGATG CGATGAGCAA CATCGCCAAT CTTGAGCGGG TGCGTAACTC GGTCAACACC
AGCGTCTGCA GCACGACGCC CGGCGCGGCG ATCCCCTGTA TCGACTACCT CGGCGCTGGA
GATGTGACGC CCGCGGCCCT GAAATACATC CTGTTCAATT CCCGCGACAC CGGCGGCAAC
GAACAAAAGA GCGTCACCGC CGATGTCACC GGCACGCTGC TCAGCCTGCC GGCCGGCCGC
CTGGCGGTGG CCGCCGGTGT CGTCTATCGC CAGGAAAAGG GCTGGCGTGA TCCCGACGCC
CTGACGGTGC TGGGCATCGC CAACACCAAC CAGCAGGACC CGATCTCGGG TTCGGTGACC
GCCAAGGAAG CCTATCTGGA ACTTTCGGCG CCCCTGCTGG CCAACCTGCC GATGGTCCAG
AGCCTGGATT TGGACGCGGC GGTCCGCTAC TCGGATTATG ACCTGTTCGG CTCGAACGAG
ACCTACAAGA TCGGCCTGAA CTGGGCCGTG ATCCCCAGCC TTCGCCTGCG CGCCACCTAT
GGGACGGGCT TCCGTATCCC CAGCGTGCCC GAGCTGTTTG GCGGGGTGGC CGAGGGCAAC
CTGACGACCA CCGATCCGTG CAGCCGCTAC GCCACCAGCG GCAATGCCAC CCTGATCGCC
AACTGCCAGG CCACTGGCGT TCCGGCCAAC TACGTCCAAC TGGGCAATAC GGTGCTGACG
ACCGTCGGCG GCAACCAGAA CCTGCAGCCG GAAACCGCCA AGACCCTGAC GCTCGGGGCG
GTGATCCAAC CGCCGATCAT CCCGGGCCTG TCGGTGACGG TCGACTACTT CGATATCGAG
ATCAAGGACG CCATCCGCTC GATCCCCGGC TCCACCAAGT TGGCCGTCTG CTATGCGACC
CCCAGCCTGG CTCACCCGTT CTGCGGCGCC TCCAACTTCA CGCGCAGTCC GCTGACCGGC
GAGATCACCT ATCTGTCCGC CCAGCCGACC AATGCCGGCC TGGAGAAGAT GAAGGGCGTC
GATGTCGGAG TTCGCTATGA CTTCGACATC CGGGGCCTGC GCGCCAGCCT CGACTGGAAC
ACCACCTATC TCGACAGCTA TGTGGTGACC CCGTTCCAGG GCGCCGATCC TATCGTCTTC
GACGGCCATA TCGGCGGCGG CAACGGCGGC TATCCGCACT GGCGCTCCAA TGGCAGCTTT
TCGCTGATGG CCGAACGCTG GACGGGGACC TATTCGGTGC AGTGGATCGG CAAGGCCACT
GACTTCAACG CCGCGCCCAC GGCGATCGGC TACAAAACGC CCGACGTCTT CTACCACAAC
GCCCAGTTCG CCTACCGGCT GGGGGGGCAG GCCGACATCG CCGTCGGCGT CGACAACCTC
TTCGACAAGA AGGCGCCGTT CATCCAGAGC TGGACCGACG GCAACACCGA CACCATGACC
TACGACCTGC TGGGTCGGCG GGGCTATGTG CGGCTGAGCT ACAAGTTCAA CTAG
 
Protein sequence
MKLIDKRTLL ASVAAMATLV GAGQAFAQAT ETATVEEVIV TGSRIKRADI TGVGPATVIS 
QDQIERTGLT NVESLLQRLP ASAGNAGNQT NAYWTGNGYG TAQVNLRGLG INRTLTLING
RRVVNGGTGA NSAPDLNMIP TAIISRIDVL KDGASAIYGA DAVAGVVNII TQNNFEGLKL
SAKYGVTDEG DGQDYTVDVL WGMRNERGGV TAGLTYQKTE AVNLASRAPC GLGEVGGQLV
CISSASTIGG RAVLPNGQQI NFNQTPGGNG NFYEPYSAAK HNFNSNPFLN AVSPIERIST
AFLADYKLTD TVTLFGELFF THRESHQIAT PGTLRNLAIL ASNPTNPTGQ NITLLQRRLA
EPGPRIFSQE TDTYRFVGGA RGSLAGDWTW EAAANWGRNT GVDAMSNIAN LERVRNSVNT
SVCSTTPGAA IPCIDYLGAG DVTPAALKYI LFNSRDTGGN EQKSVTADVT GTLLSLPAGR
LAVAAGVVYR QEKGWRDPDA LTVLGIANTN QQDPISGSVT AKEAYLELSA PLLANLPMVQ
SLDLDAAVRY SDYDLFGSNE TYKIGLNWAV IPSLRLRATY GTGFRIPSVP ELFGGVAEGN
LTTTDPCSRY ATSGNATLIA NCQATGVPAN YVQLGNTVLT TVGGNQNLQP ETAKTLTLGA
VIQPPIIPGL SVTVDYFDIE IKDAIRSIPG STKLAVCYAT PSLAHPFCGA SNFTRSPLTG
EITYLSAQPT NAGLEKMKGV DVGVRYDFDI RGLRASLDWN TTYLDSYVVT PFQGADPIVF
DGHIGGGNGG YPHWRSNGSF SLMAERWTGT YSVQWIGKAT DFNAAPTAIG YKTPDVFYHN
AQFAYRLGGQ ADIAVGVDNL FDKKAPFIQS WTDGNTDTMT YDLLGRRGYV RLSYKFN