Gene Caul_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1970 
Symbol 
ID5899425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2113133 
End bp2116063 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content67% 
IMG OID641562460 
ProductTonB-dependent receptor 
Protein accessionYP_001683597 
Protein GI167645934 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCTA GAATTTTCGC GCTCTTTGGC GCCGTGTCGA CGCTCGCCTT GAGTCTGGCT 
CACGCCGGGG TCGCCAACGC CCAGGCCTCA CGGCCAACGG CGGCCGCCGC CACCTCGGTC
GAAGAGGTCG TGGTCACCGG ATCGCGCATC GCGCGCCAGG ACTACACCGC CCAAAGCCCG
ATCGTCACGA TCGGCAAACA GATTCTCCAG GCCCAGGGGC CGACCTCGAT CGAGAACACC
CTCAACCAGT TGCCGCAGTT CGCCGCCACC TCCGGCTCGA CCTCGGCCAG TCAGGCCGGC
GGCGGACGGG CCAACGCCAA CCTGCGCGGC CTGGGCACGG CCCGCACCCT GGTGCTGCTG
GACGGCCGAC GGCTGCAGCC GTCCGACCCG CTGGGCGCCA TCGACCTCAA CACCCTGGCG
CCCAGCCTGA TCGACTCGGT CGAGGTGATC ACCGGCGGCG CCTCGGCGAT CTACGGCTCC
GACGCGATCG CCGGCGTCGT CAACTTCAAG CTCAAGCACG ATTTCGAGGG CCTGGAGATC
GACGCCCAAT ACGGCCAGAC CGAGCGCAGC GACGGCGAGA CCGCGGATCT GAACGCCACG
GTCGGCGGCG CCTTCGCCGA CGACAAGGGT CACGTGATGC TGTCGGCGTC CTATATGGAT
CGCCAGCGGG TCAACCGAAA TTCCCGCGCC TTCTTCAAGG ACGGCGGCAT CACCTCGGTG
CTGCCCAGCG GCCTGATCTA CGCCGACGCG GCCAACCTGC CGACCCAGGC GGCGATCAAC
GGCGTCTTCG CCAAATACGG CGTGACCATC AACGTGCCGC GCAACGCCAC CTACAGCCAG
AACCTCGACG GCACGCTGTT CACGATCTCC TCGCCGATCC TCAACTACCG CTTCCCTGCC
GACGGCCCCT ACACCATCAC CAGCAACGGC CAGGTCGCCG TGCCGCTGGG CGAGGCTGCG
CCCCTGCAGC AGCCTCTCAA GCGCCAAACC CTGTTTGGCC AGGTCAATTA CCAGCTCAAC
GACGCCGTCG AGGCCTACGG CCAGTTCAAC TACGCCCACT ACGTCTCCAA CCAGAGCGGC
TACGGCCGCA ACCAGGCGAT CACCCGCGAC GTCTATCTGC CGGTCACCAA CCCCTTCCTG
CCCGCCGACC TCAAGGCGAT CGCCGCCTCG CGGCCCAACC CGACCGCGCC GCTGCTGTTC
TACTTCAACA CCGGCCGCTA CTCGCCTGAC ATCGCCCAGC AGACCTATAA TGTCGCCCAG
GCCCTGGGCG GCCTGAATGG CCGGTTCGAC GCGATCGACG GGACCTGGGA CGTGTTCGCC
TCCTATGGCC GCACCGAGCA GAACGCGTTG TTGGGCGGCT ATATCGACCG GGCCTCGTTC
TTGTCGCTGG TCAACGCGGC CGATGGCGGC AAGAGCCTCT GCGAGGGCGG TCTGGACCCG
CTGGCCCTCA CGGCGCCCTC GAAATCGTGC CTGACCTATC TCCTGCGCGA GATGCACGAG
ACCACGACGC TGCAGCAGAC CAACGCCGAG GCCAATGTCC AGGGCCGGGT GCTGGCCCTG
CCGGCCGGCG ACCTGCGGTT CGCCGCGGGC CTCAGCTATC GCAAGAACAG CTACGCCTAT
TCGCCGGACG CCGCGCGCCT CACCGGGACG GTCCTGACCA CGGGTCTGAG CAAGGCTACG
GAAGGGGAGA CCAATTCGAA AGAGGTCTTC CTCGAGCTGG CGGTCCCGCT GCTGAAGGAT
CTGCCCTTCA TCCGCAAGCT CGACCTGGAT CTGGCCTATC GCTACTCCGA CTATGACACG
GTCGGCGGCG TTCACACCTA CAAGGCCAGC GGCGCCTGGG AGATCAACGA CGCGGTCAGC
CTGCGCGGTG GCTATCAGCA CGCGATCCGC GCGCCTTCGG TGGGCGAACT GTTCCGCCCC
TCCGAGCAGA GCGCGACCAC CGTCGGGCGG ATTTCCGCCG GCCTGGGCGA TCCCTGCGAC
ATCGCCAGCG CCTATCGCAC CGGCCCCAAC GCCGTCCAGG TGCGCGCTCT GTGCATCGCC
AACGGCATTC CGCTCAGCGT GATCGACGCC CACAAGTTCG CCGGCACCTC GGTGCAGTCG
GACGTTGCGG GCAATCCCGA CCTGAAGGAA GAGTCTTCTG ACACCTATAC GGCGGGCCTG
GTCTGGCGCT CGACCTTCCA GCAGCCCCTG CTGCGCCGGC TCTCGGCCTC GGTCGACTAT
TACGACATCA AGCTGACCGA CGCGATCGGC CTGATCACCG GCGACGTGAT CGCCCAACGC
TGCTTCAACG GTCAGGGTAA CGCCAACCCG ACCTACGATC CGGCCAACTA CTACTGCAAG
CTGATCCATC GCGGCTCTTC GGGCGGCTTC TCCAGCATCA CCACGCCGCT GCTCAACCTG
GCCGGCTACC GGACCTCGGG CGTCGACCTT CAGGTCGACT GGGGCGCGGA GCTGGCGGCG
TTCGGCGCTT CCGAGCAGGC AGGGTCGGTC AGCCTCAATG TGCTGATCTC CCATATGGAC
AAGTACGAGA TCCAGACCCT GGCCGGCGCG GCCTACGTCA ACTACGCCGG CACGATCGGC
AACGCCCAGA TCAGCGCCGA CGCCATCTCG CACCCCAAGT GGAAGGCCAT CACCAGCCTG
GGCTACCATG TGGGGCCCAT CGACCTGAGC CTGCGCTGGC GCTGGGTCGA GGCCATGGGC
AACGCCGCCA ATGTCGGCTC GGCCACCGCC ACCGCGCGGG GCGTCAAGGC GATGGACTAT
TTCGACCTGA CGGGACGCTA CCAGGTCAAC TCCACCGTCG AGCTAAGGGC CGGGGCGCTC
AATCTGGCCG ACCGCCAGCC CCCGGCCTGG ACCGGCGAGA GCGCCACCGA CACCGCCCTC
TACGACATGC TGGGCCGCCG CTACTTCCTG GGCGTCAACT TCAAGTTCTG A
 
Protein sequence
MQSRIFALFG AVSTLALSLA HAGVANAQAS RPTAAAATSV EEVVVTGSRI ARQDYTAQSP 
IVTIGKQILQ AQGPTSIENT LNQLPQFAAT SGSTSASQAG GGRANANLRG LGTARTLVLL
DGRRLQPSDP LGAIDLNTLA PSLIDSVEVI TGGASAIYGS DAIAGVVNFK LKHDFEGLEI
DAQYGQTERS DGETADLNAT VGGAFADDKG HVMLSASYMD RQRVNRNSRA FFKDGGITSV
LPSGLIYADA ANLPTQAAIN GVFAKYGVTI NVPRNATYSQ NLDGTLFTIS SPILNYRFPA
DGPYTITSNG QVAVPLGEAA PLQQPLKRQT LFGQVNYQLN DAVEAYGQFN YAHYVSNQSG
YGRNQAITRD VYLPVTNPFL PADLKAIAAS RPNPTAPLLF YFNTGRYSPD IAQQTYNVAQ
ALGGLNGRFD AIDGTWDVFA SYGRTEQNAL LGGYIDRASF LSLVNAADGG KSLCEGGLDP
LALTAPSKSC LTYLLREMHE TTTLQQTNAE ANVQGRVLAL PAGDLRFAAG LSYRKNSYAY
SPDAARLTGT VLTTGLSKAT EGETNSKEVF LELAVPLLKD LPFIRKLDLD LAYRYSDYDT
VGGVHTYKAS GAWEINDAVS LRGGYQHAIR APSVGELFRP SEQSATTVGR ISAGLGDPCD
IASAYRTGPN AVQVRALCIA NGIPLSVIDA HKFAGTSVQS DVAGNPDLKE ESSDTYTAGL
VWRSTFQQPL LRRLSASVDY YDIKLTDAIG LITGDVIAQR CFNGQGNANP TYDPANYYCK
LIHRGSSGGF SSITTPLLNL AGYRTSGVDL QVDWGAELAA FGASEQAGSV SLNVLISHMD
KYEIQTLAGA AYVNYAGTIG NAQISADAIS HPKWKAITSL GYHVGPIDLS LRWRWVEAMG
NAANVGSATA TARGVKAMDY FDLTGRYQVN STVELRAGAL NLADRQPPAW TGESATDTAL
YDMLGRRYFL GVNFKF