Gene Caul_3783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3783 
Symbol 
ID5901245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4100419 
End bp4103382 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content66% 
IMG OID641564306 
ProductTonB-dependent receptor plug 
Protein accessionYP_001685408 
Protein GI167647745 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00272324 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTCAAGG CTGCAAGCTT CGTTTCAACG TCGCTGATCG CCCTGGCGGC GGCGACATCC 
GTCTCGGCGC AAACCGTGCA GACGGATAGC GGCGCATCGA CCAAGGCGGC GGAGTCTGTC
TCGGTGGAAG AGGTGGTCGT CACCGGTTCA CGGGTGCGCA CGACCTACAA TTCGCCCACG
CCGGTCAATG TGGTCGGCCA GGAGCGCATG CAGCAACTGG CCATCCCCGA CGTGGCCACC
GCGCTCAACC AGATTCCGTC ATTCCGCGCG ACCACCTCGG CCTCGACGAT CCTGTTCCGG
GTCTCGGGCG CGATCGGCGG CAACACGCCG GACCTTCGGG GCCTTGGCAC CAGCCGCACG
CTGGTGCTGG TCGATGGACG GCGCTTCGTT CCCAGTCTGG ATTCCGGCGG CGTCGACCTC
AACAGCGTGC CCAACGCGCT GGTCAAGCGC ACCGAGATCG TCACCGGCGG GGCCTCGGCC
GCGTACGGCG CGGACGCCGT GGCCGGGGTG GTCAACCTGA TCCTCGACAC CAAGTTCAAC
GGCGTCAGGA TCGACGCCAG CACCGGCGTC AGCGAACACG GCGACGGCAA GAACTACTTC
ATCTCGGCGT CGGGCGGGCG CGGCTTCGCC AGCGACCGCG GCCACATCAT CGCCGGGATC
GAATATCGCG ACGACAAGGG CGTCGGCAAT TGCTTCACCC GCGACTGGTG CGCCAAGCTG
ACGAACTTCG TTCCCAACCC CGGCTATATC GGCGGCGTGA GCACGAACGG CCTGCCGGCG
ACGCTCGTGC TCGACAACGT CAACTTCGTC TACAGTCCGA CCGGCGTCCT GCTGAGCGCG
GTGCAGACGG TCGGCGGCGT CAAGACGACG TTGGGTCAGC AGGTGGGCAA TACGGGCGCG
ACCGCGCTGC CCACCGCCCT GAGGGGGCTG CAGTTCAACG CCGCCGGTTC GGCGTTGACG
CCCTTCCAGT TCGGCAACTA CCTCAGCGGC ACGTTCATGC AAGGCGGCGA TCCAGCCGCC
AGCAACAACT GGGGCTGGGG CAACCCGCCG CTGGTGACGC CCACCTCGCA CGCCTCGGGC
CTGGTGCATG TCGACTATGA CCTGACGCCC AAGACCCAGG CCTTCGGCGA GTTCATCGTC
AGCCGCACGG AGGGCGGGCC GGTGCGCACC TCGGTGCTGC TGCAGGCGCC CGCCGGCGGC
TCGGCGGGGC TCGACATCAA CAATCCCTTC ATCACCCCGG CGGTGCGCGC GCAGATCCTT
GGCGCCAACC CCAACATCAC GGCGATCAAT GTCAACGTCG CCGTGGCGCA GGGCGGCGAC
ACAGTGGTGG CCTCCAGCAC CAACGACATC TACCGGTTCG TGACGGGGCT GAAAGGTGAT
CTCTTCGGCG ATTGGCGCTG GGACGCCTCG TACGAGTACG GCCGGACCAA CAGCGAGACG
ATCGTCAAGA ACACCCGTCT GGCCGCGTTC GACACCCAGG CCACCAATGC GATCACGCCG
CCGGCCGGCT ATACGGGGAC GATCTACACC ACGCCCGCCG GCGCGCCGGT GATCTGCGCC
TCTTCGGTGG CCAATCCGTC AGACGGCTGT CTTCCGGTCG ACCTGCTGGG CTCGAACATC
ACCCCCGCGG TGCTGAGCAA GTACTTCAAG GATGAGCGGC AGACCCGCAA GATCACCCAG
AACGACGTGA TGGTGAACTT CCGCGGCACG TTGTTCAGCC TTCCAGCCGG GCCGATCCAG
GCCGCGTTCG GCGCGGAATA CCGTCGCGAC AGCGTCTCCG GCGACGTCGA CGCCCTGACG
GCGGCCGGAC GGTTCGCCGC GCCCCAGGTG ACGGCCTTGC CGGAGGTCGT GCAGAAGGTG
ACGGAAGGCT ACGCGGAGGC GAACATCCCG CTGCTGGCCG ACCTGCCCTT CGCCAAGTCG
CTGTCGGTCG ATGTGACAGG ACGCCTGACG CACTACAGCG GCTTTGGCAG CGCCAGGCCC
TGGAAGATAG GGCTCGAATA CCAGCCCAAC GACCAGATCC TGGTCCGCGT GACGCGGTCG
GCCGACATCC GCGCGCCCAG CGCGGCGGAA TCGAACCCCA ACACGGTCCA GACCTTCCTG
CCCCTGAACG ATCCGTTCAG CGGCAGCAAC CACCTGATCG GCGCCCCGGC CGGCGGCAAT
CCCAACCTGG AGCTGGAATC GGCCAAGACC AACACCGCCG GGATCGTGCT GAAGCCGAAC
TTCCTTCCGG GCTTCCACGC CTCGGTCGAC TGGTATGACA TCACCGTAAA GAACGCGATC
GACGCGGTGA CGGCGCCCAA CATCCTGTCG GCCTGCGCCA CCAAGAACCT GCTGTGCAAC
CTGATCACCT TCAGCGGCGC GGCCAAGGCC AGCCCGGTGG TGTCGGTGCT CTCGAACTTC
CAGAACGTCG CCCAGGTTCA CGCCGAGGGT TACGAATTCC AGTCCGACTA CACGATCCCG
GACGTGTGGG ACGGCGCCGT CACCTTCCAG CTCAACGCCA ACTACGTCAA AGACCTGAAG
TCGATCGGCG GCACGGGCCT GGTCACCCGG ATGAACGGCG TCACCGGCAA CGCCGGCTCG
CTCGCCGGCA TCGCGGGGGT GCCGAAGTAC AAGATCGACG GCCTGGTCGC CTATACGCGG
CCCAGCTGGA TGGTCGCGGC CCACATGCGC TACATCCCGG AGAGCATCCT GGATCCGACC
AAGATCGGAC CCAAGCAGGC GGGCTACAAC ATCAACCTCC CGACCAGCAT CATGATCAAT
TCGGTCAGCT CGCGTTTCTA TCTGGACCTT TCCGGCTCGG CCCACCTGCC GTCGATCTTC
GGCAGCAGCA AGACGGAACT GTACGGCGGC GTCACCAACG TCTTCGACAA GGACCAGCCG
CCCGAGCTGC GCCTGTTCGG CAACCCTCTG CAGTACGACA CGGTTGGCCG CGCCTTCCGA
CTGGGCATCC GGGCCGCCTG GTAG
 
Protein sequence
MLKAASFVST SLIALAAATS VSAQTVQTDS GASTKAAESV SVEEVVVTGS RVRTTYNSPT 
PVNVVGQERM QQLAIPDVAT ALNQIPSFRA TTSASTILFR VSGAIGGNTP DLRGLGTSRT
LVLVDGRRFV PSLDSGGVDL NSVPNALVKR TEIVTGGASA AYGADAVAGV VNLILDTKFN
GVRIDASTGV SEHGDGKNYF ISASGGRGFA SDRGHIIAGI EYRDDKGVGN CFTRDWCAKL
TNFVPNPGYI GGVSTNGLPA TLVLDNVNFV YSPTGVLLSA VQTVGGVKTT LGQQVGNTGA
TALPTALRGL QFNAAGSALT PFQFGNYLSG TFMQGGDPAA SNNWGWGNPP LVTPTSHASG
LVHVDYDLTP KTQAFGEFIV SRTEGGPVRT SVLLQAPAGG SAGLDINNPF ITPAVRAQIL
GANPNITAIN VNVAVAQGGD TVVASSTNDI YRFVTGLKGD LFGDWRWDAS YEYGRTNSET
IVKNTRLAAF DTQATNAITP PAGYTGTIYT TPAGAPVICA SSVANPSDGC LPVDLLGSNI
TPAVLSKYFK DERQTRKITQ NDVMVNFRGT LFSLPAGPIQ AAFGAEYRRD SVSGDVDALT
AAGRFAAPQV TALPEVVQKV TEGYAEANIP LLADLPFAKS LSVDVTGRLT HYSGFGSARP
WKIGLEYQPN DQILVRVTRS ADIRAPSAAE SNPNTVQTFL PLNDPFSGSN HLIGAPAGGN
PNLELESAKT NTAGIVLKPN FLPGFHASVD WYDITVKNAI DAVTAPNILS ACATKNLLCN
LITFSGAAKA SPVVSVLSNF QNVAQVHAEG YEFQSDYTIP DVWDGAVTFQ LNANYVKDLK
SIGGTGLVTR MNGVTGNAGS LAGIAGVPKY KIDGLVAYTR PSWMVAAHMR YIPESILDPT
KIGPKQAGYN INLPTSIMIN SVSSRFYLDL SGSAHLPSIF GSSKTELYGG VTNVFDKDQP
PELRLFGNPL QYDTVGRAFR LGIRAAW