Gene Caul_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2073 
Symbol 
ID5899528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2217229 
End bp2220231 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content66% 
IMG OID641562562 
ProductTonB-dependent receptor 
Protein accessionYP_001683699 
Protein GI167646036 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.168139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.78272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCTT TAGGTCGGAC GCGACGGGGA TCGCCCTTTT TGAACGTGTT GTTCGCGACG 
AGCGCGGTCG CGATGATCGT ATCGGCTGGA TCGGCCGTCG CGCAGACGGC GCCGCCGCCC
GCGGAGCCGG ACGCCACGGC CTTGACCGAG ATCGTGGTCA CGGGGACCCG CATCCGGTCC
ACCGGCTTCA CCGCGCCCAC GCCGACTCAG GTCCTGGGCC AGGCCGACCT GGAGCGCAAC
GCCGAGCCGA ACGTCTTCAC GACCATCGCC CAGCTGCCTT CCTTGCAAGG GTCAACGGGG
GCGACCACGG GCACGTTCAG CACGTCGAGC GGCCAGCAGG GCCTGAGCTC GTTCTCGCTG
CGAGGCCTGG GCACGATCCG GACGCTGACC CTGCTGGACG GCCAACGGGT GGTTCCGGCC
AACATCACGG GTGTTCCTGA CATCAGCCTG TTCCCGCAAT TGCTGGTCGA GCGGGTCGAT
GTCGTGACTG GCGGCGCATC CGCCTCCTAC GGCTCCGACG CGGTCGGCGG CGTCGTCAAC
TTCATCACCA ACAAGCGCTT CGAAGGCTTC AAGGCCAATG TGTCGGCCGG GGTCACGACC
TATGGCGACA ACGAGCAGTT CCTGCTGCAG GTCGCGGGCG GCAAGGCCTT CATGAACGAT
CGCCTGCATG TCCAGGTCAG CGGCGAATAC GACGAGGAGC AGGGCGTTCC CGCCGGCGGC
TTCGGTGAGG ACGCGCCTGG CAGCCGTGAC TGGTACACGA CCGCCACCCT GGTCAATCGC
GGCGTCACCA ACGACGGCTC GCCGCAATAC CTCTACCGCG AGCACGCCCA AGCTTATCAG
TACACCAAGT ACGGCCTGAT CAGCGCTGGT CCGCTGCAAG GCACCGCCTT CGACCAGGCC
GGCAACCCCT TCCAGTTCCA GTACGGCTCC AATGGCGTGC CGTCCAAGGC CGCCAACGGC
GCGGTGATCG GTTGCTACTC GAACGGCGGC TTCTGCGTCG GCGGCGACCT GTCGGGCAAT
GTCGGCGTCG GCACGTCGCT GCAATCCGAA ATCACGCGGA TGAACAGCTA CGGCCGCGTC
GCCTACGACA TCGACGACAA CAACGAGATC TACGGCACGC TGAACATCGC CCGCGTCGAG
TCCGCCAACC AGCCCAATCC TGGCGGCGCC ACGACCGGCC TGACGATGCA GTGCTCCAAT
CCCTATCTGC CGGCGTCCGT CGTGGCCGCC TGCGCCGCCA ACAACATCAC CAGCTTCACC
TTCGGCACCA GCAACGCCCT GCTGCCCAAC AATATCTCGG TCCATCCCAC GCGGACGCAA
TATCGCGGCG TGATCGGCGC AGACGGCAAG TTCGCCGCCT TCGGCACGGA TTGGCACTAC
GACGCCTATT ACACGCACGG CGAGAACACC ACGAACATCC ACGTCAACAA CATCATGCTG
ACCCCGCGCT ACAGGGCGGC CATCCAGGCC ACCCTGGTCA ATGGGCAGAT CGTCTGCGCC
GACCCCGTGG CGCGGGCTAA TGGGTGCCAG CCGATCAACG TCTTCGGCGG GCAGCGACCG
AGCGACGCGG CGCTGAAGTA CATCGAGCCC GAGAACGGTC CGTATCAGCA TTCGGTGCAG
AAGCAGGACG TCGCCAGCAT CAACTTCAGC GGCGAGCCCA TCCAGGGCTG GGCCGGCCCG
GTCGCCTTGG CCTTCGGCGC GGAATATCGC CGCGAGGCCT ACCACGTCCA GGGCGACTCC
TACGGCGACG GCTCGGCGGC CTCGCCCTAC ACCACCGACT ATCCGGCCGA CCCGTTGCTC
AACACGGCGG GCAACAACTG GTACGCGGGC AACTATCACT CGGGCGCCGG CAAATACACC
GTCAAGGAGG CCTATGCGGA AGTGAACCTG CCGCTGGTCA ATTCGGACGC CGCCGGCAAG
GCCAACCTGA ACTTCGCCGG TCGCTGGACC GACTACAGCA CCTCGGGCAC CGTCTATACC
TGGAAGGTCG GCGGAACCTG GGACACGCCG ATCGATGGCG TCCGCCTGCG GGCGGTGACC
TCGCGCGACG TGCGCGCGCC GAACCTGTCG GAACTCTACG CCGCGCCGGT CACCACCACC
CTGCCCAACT TCACCATTCC TTCGAACGGC ACGCCCCCGC CGGGCCAACC CTCGGGTCCC
GGTGCGCTGC TGGTCCTCCA GAACGTGGTC GGCAACCCGG ACTTGAAGCC CGAGATCGCC
AAGAACACGA GCCTCGGCGT GGTGCTGGCG AACCCGAAAT GGCTGCCGGG CTTCAGCGTC
TCCGTGGACT GGTACAACAT CGTGCTGAAT GGCGGTATTT CCAGCCTCAG CGCTCAGCAG
GTGGTCAACT TCTGCTATTC GGGCCTGACG CAGTATTGCG GCAGCTTCAA CTTCGCCCCG
CCGGCCGGCA CGACCCCCTA CGTCAACGCC CAGGTGTTCA ACCTGGCCTC GATCAAGACC
AGCGGCTTCG ACATCGAGAC CAGCTACCGG TTCGAGATTC CCAGCGTTCC GGGCCGCTTC
GACCTGCGCG CCCTGGCGAC CAACACCCAC AAGTTCGTCA CCAATCAGGG CTTCCCCGGC
GGCTCCGTTG TCGATACCGC GGGCCAGAAT TCCGGGGCCA CGCCAGACTG GAAGGTGCTC
GCCGTCCAGA GCTGGAGCTC GGACCGGTTC ATGCTCAGCC TGCAGGAACG CTGGTTCAGC
GATGGCGTCA TCGGCGGCCA GTACATCGAG TGCGCCGCCG GCAGCTGCCC GGTGGGTAGA
ACCGCAGCCG ATAACAACAA CTACCCGACC ATCGACCACA ACCAGATGAA GGGCGCCACC
TATGTCGACG TGAGCGGCTC CTACAAGGTG ACCAAGAGCC TCCAGGCCTA TTTCAAGGTC
GCCAACCTCT TCAACAAGGA CCCGACGCCG TCGCCGCAAA CCAACACCGG TCTTGACGCC
AATCCGGCGC TCTACGACCT GCTGGGGCGG TTCTACCACG TGGGCCTGCG CTACAGCTTC
TAG
 
Protein sequence
MSALGRTRRG SPFLNVLFAT SAVAMIVSAG SAVAQTAPPP AEPDATALTE IVVTGTRIRS 
TGFTAPTPTQ VLGQADLERN AEPNVFTTIA QLPSLQGSTG ATTGTFSTSS GQQGLSSFSL
RGLGTIRTLT LLDGQRVVPA NITGVPDISL FPQLLVERVD VVTGGASASY GSDAVGGVVN
FITNKRFEGF KANVSAGVTT YGDNEQFLLQ VAGGKAFMND RLHVQVSGEY DEEQGVPAGG
FGEDAPGSRD WYTTATLVNR GVTNDGSPQY LYREHAQAYQ YTKYGLISAG PLQGTAFDQA
GNPFQFQYGS NGVPSKAANG AVIGCYSNGG FCVGGDLSGN VGVGTSLQSE ITRMNSYGRV
AYDIDDNNEI YGTLNIARVE SANQPNPGGA TTGLTMQCSN PYLPASVVAA CAANNITSFT
FGTSNALLPN NISVHPTRTQ YRGVIGADGK FAAFGTDWHY DAYYTHGENT TNIHVNNIML
TPRYRAAIQA TLVNGQIVCA DPVARANGCQ PINVFGGQRP SDAALKYIEP ENGPYQHSVQ
KQDVASINFS GEPIQGWAGP VALAFGAEYR REAYHVQGDS YGDGSAASPY TTDYPADPLL
NTAGNNWYAG NYHSGAGKYT VKEAYAEVNL PLVNSDAAGK ANLNFAGRWT DYSTSGTVYT
WKVGGTWDTP IDGVRLRAVT SRDVRAPNLS ELYAAPVTTT LPNFTIPSNG TPPPGQPSGP
GALLVLQNVV GNPDLKPEIA KNTSLGVVLA NPKWLPGFSV SVDWYNIVLN GGISSLSAQQ
VVNFCYSGLT QYCGSFNFAP PAGTTPYVNA QVFNLASIKT SGFDIETSYR FEIPSVPGRF
DLRALATNTH KFVTNQGFPG GSVVDTAGQN SGATPDWKVL AVQSWSSDRF MLSLQERWFS
DGVIGGQYIE CAAGSCPVGR TAADNNNYPT IDHNQMKGAT YVDVSGSYKV TKSLQAYFKV
ANLFNKDPTP SPQTNTGLDA NPALYDLLGR FYHVGLRYSF