Gene Caul_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1988 
Symbol 
ID5899443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2136193 
End bp2139255 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content70% 
IMG OID641562477 
ProductTonB-dependent receptor 
Protein accessionYP_001683614 
Protein GI167645951 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTCA TGGTTGCAGC GTTTCGTCGG GGCCTCGTCG GCCTTTCCCT TCTGGCCCTG 
ACCACCGCGA TCGCCGGCCC GGTCCAGGCC GCCCAGGCGC CCTCGGCGCT GGTGCCCGTC
TCGGCGGTGC GAACGCCCAA GGGGCCGCTG AAGGCCGCCT TGCTCGCCCT GGGCCGCCAG
ACCGGCGTGC AGATCATCTT CACCAGCCGG GCGGTCGAAG GCCGCCAGGC CCCGGCCCTG
GACGGCCAAT TCAGCGTCGA CGAAGCGCTC GACCGCCTGC TGGCCGGCAG CGACCTGGAG
GCCCAGCGCG TGGGCCCGAC GGTCCTGGTC GTCCGGCCGC GCGCCGTCCT CGTTCCCGCC
TCGGCGACCT CGCCGACGGC GCCCGATCCG ATGGCCTCGC CGCCGACCGA TCCGATTGAG
ACGGCCCCGC CCGTCCAGGC CGTCAACGAC GCCACCATGC TGTCGGAGGT CGTGGTCGGC
AGCCACATCC GGGGCGCCCG TGACAGCGCC TCTCCGGTCG TCATCCTGGA CCGCGAGACT
CTGGACCGCG CCGGCCGCAC GAGCGTCGCG GAGGCCATGT CCAACCTGCC GCAAGCATTC
AACGGCTCGG GCAGCGAAGA CACCACCTCG ACCGGCGCCG ATCCTTTGGG CACCAATTCC
AGCCGCGGCG TCGGGGTCAA TCTGCGCGGC TTGGGAACCG ACGCGACCCT GGTGCTGGTC
AATGGCCGCC GCCTGGCAGG CACGGGATTG AAGGGCGATT TCGCCGACGT CTCGTCGATC
CCGATGGCGG CGGTGGACAG GATCGAGGTG CTGCTCGACG GGGCCTCGGC GCTCTATGGC
TCCGACGCGG TCGGCGGCGT GGTCAACATC GTGCTTCGCA AGCGCTACGA AGGCGCGGAA
ACCCGGGCTC TGGTCGGCGG CGCGACGCGC GGCGGGGCCA CGCAGTGGCA GTTTGGCCAG
ACGGTCGGCC ACGCCTGGGA CAGCGGAAAC CTGGTGGTCA GCTACGAGCA CAGCGCCCGC
GACCGACTGC GCGGCCGCGA CCGCGACTTC ACCGGCAACG CCGACCTGCG CGACCAGGGC
GGGACCGACC ACCGCCGCTA CTACAGTCAG CCCGGCAATA TCCTGCGCGC CAACGGCAGC
GGGGTCCTGG TCCCGACCTA CGCCATTCCT GGCGGACAGA ACGGCGTGGG CCTTTCGCCA
GCCAGCTTCG CGGCCGGCCA GACCAACCTG GAAAACCAGC AGCTGGCGTT CGACGTCCTG
CCGCGCCAGC GCCGCGACAG CGTCTACCTC GCGTTCGCCC AGGACCTGAC CCCCGCCATC
GAACTGTCGG CCGACGCCCG GGCCACCCGC CGTGACTTCA CCAGCCGGGG CGGCGCCTCG
ATCACCACCC TGACGGTCAA CGCCAGCAAT CCCTATTTCG CCTCTCCGAC CGGGGCGGCG
TCCGAGCGCA TCGCCTATTC GTTCCTCAAC GAGCTGGGCG GCCAACGGGT CAAGGGCGTG
GCCGACACCC TGGGCCTGTC GCTTGGCGGG ACGGCGCGCC TGCCGGCCGG CTGGCGGCTG
GAGACCTACG GGGCCTATGG TCTTGAGACG ATGCACTCCC TGACCAACAA CCTGGTCAAT
TCGTCCGCCC TGGCGGAGGC CCTCGGCGCC ACGCCCGACA ATCCCGCCAC CGGTTTCAGC
ACCGCCGCGT CCGGCTATTT CAATCCGTTC ATCGGCGCGG GCTCCAATCC ACGCTCGATC
CTGGATTTCA TCAACACGGC CTTCGTCGAT CGCAAGACCC GCAGCGACAC ACGCTCGATC
AACCTGAAGC TCGATGGGAC CTTGTGGTCC CTGCCCGCCG GACCGGTGGG ACTGGCGGTC
GGCGGCCAGA TCCGCCGCGA GGGGCTCAAG AGCGGCGGGC AGAGCCTGGC CTCGGGCGTT
TCGCCGATCC CGATCGCCCG AAAGGACACC GACCGCACGG TCGACGCGGC CTTCGCGGAG
GTCCGACTGC CATTGTTTGG CGGAGCGTTC ACGCGGCCGG GCCTGCGGCG TCTGGAACTG
TCGGCGGCGG TTCGTCACGA GGACTATGGC GGCGCGCTCA AGAGCACGGA CCCCAAGCTT
GGAGTGATCT GGTCGCCGGT GGCGGGCGCC ACCCTCAAGG CCTCCTACGG CACGTCGTTT
CGCGCGCCAG CCCTCACCGA GCTGAACGAT CCGCAGATCT TCGCCCCGAC CACGATCAAC
ACCAGCGGTC GCGAGACCAT CGTGATGATC CTCTATGGCG GCAATCCCAA TCTGAAGCCC
GAGACGGCGA CCTCCAAGAC CCTGACCCTG GAGCTCGCGC CGCCCGACTG GTCGCGATTC
AAGGCCTCGC TGACCCTGTT CGACACCCGG TTCACCGACC GGATCGGCCA GCCGGGCAAC
GAATATATCG ACAGGGTCCT GACCTCGGCG GAGTTCGCGC CGTTCGTCAC CCTGGTGTCG
CCCGCGACCA ACACCGCGGA CCGCGCGCGC ATCCAGGCCC TGATCGACGA CCCGCGCTCC
TACGCCCAGG GGGTCTTTCC GGCGGAAGCC TATGGCGCGA TCGTCGACGG CCGTTACGTC
AACACGGGTC AGCTTCGGGT GCGCGGTCTG GACGTCTCGG CCCAGTATCA GGCCAGGCTC
GGCGGCGATC CTCTGGTGCT GTCGGCGGAC CTGTCCTGGA TGATGGACTA CAGCCGCAAG
ATCACGCCCG GCACACCCAG CGTGGATCGC GCCGGCTTCG TGGGCGAGCC CGCCGACCTG
CGCGCGCGCT ATGCGGCCAG CTGGACGCAC GGGTCCCTGA CGACCACGGC TTCGATCAGT
CAGGTGGGGG ATCTATCGAC CGACGGCGGC GGCCGCATCA AGGGCTGGAC CACCGCCGAT
CTCAACCTGA GCTACCGTTT TGGCAACGGC AGGCTGGAGG GCTCGGGCCT GTCCCTGAAT
ATGCAGAACC TCTTCGACAG CGATCCGCCC TTCTACGACT CGCCGCTCGG GGTGGGCTAT
GACCCGGCCA ACGCCGACCC GCTGGGTCGC GTCGTGACGC TGCAGCTGAC CCGGACCTGG
TAG
 
Protein sequence
MVVMVAAFRR GLVGLSLLAL TTAIAGPVQA AQAPSALVPV SAVRTPKGPL KAALLALGRQ 
TGVQIIFTSR AVEGRQAPAL DGQFSVDEAL DRLLAGSDLE AQRVGPTVLV VRPRAVLVPA
SATSPTAPDP MASPPTDPIE TAPPVQAVND ATMLSEVVVG SHIRGARDSA SPVVILDRET
LDRAGRTSVA EAMSNLPQAF NGSGSEDTTS TGADPLGTNS SRGVGVNLRG LGTDATLVLV
NGRRLAGTGL KGDFADVSSI PMAAVDRIEV LLDGASALYG SDAVGGVVNI VLRKRYEGAE
TRALVGGATR GGATQWQFGQ TVGHAWDSGN LVVSYEHSAR DRLRGRDRDF TGNADLRDQG
GTDHRRYYSQ PGNILRANGS GVLVPTYAIP GGQNGVGLSP ASFAAGQTNL ENQQLAFDVL
PRQRRDSVYL AFAQDLTPAI ELSADARATR RDFTSRGGAS ITTLTVNASN PYFASPTGAA
SERIAYSFLN ELGGQRVKGV ADTLGLSLGG TARLPAGWRL ETYGAYGLET MHSLTNNLVN
SSALAEALGA TPDNPATGFS TAASGYFNPF IGAGSNPRSI LDFINTAFVD RKTRSDTRSI
NLKLDGTLWS LPAGPVGLAV GGQIRREGLK SGGQSLASGV SPIPIARKDT DRTVDAAFAE
VRLPLFGGAF TRPGLRRLEL SAAVRHEDYG GALKSTDPKL GVIWSPVAGA TLKASYGTSF
RAPALTELND PQIFAPTTIN TSGRETIVMI LYGGNPNLKP ETATSKTLTL ELAPPDWSRF
KASLTLFDTR FTDRIGQPGN EYIDRVLTSA EFAPFVTLVS PATNTADRAR IQALIDDPRS
YAQGVFPAEA YGAIVDGRYV NTGQLRVRGL DVSAQYQARL GGDPLVLSAD LSWMMDYSRK
ITPGTPSVDR AGFVGEPADL RARYAASWTH GSLTTTASIS QVGDLSTDGG GRIKGWTTAD
LNLSYRFGNG RLEGSGLSLN MQNLFDSDPP FYDSPLGVGY DPANADPLGR VVTLQLTRTW