Gene Caul_0429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0429 
Symbol 
ID5897703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp469757 
End bp471973 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content64% 
IMG OID641560915 
ProductTonB-dependent receptor 
Protein accessionYP_001682064 
Protein GI167644401 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTCATC GCAAAGCTTG GCTGTTGTGC GCTTCGGCTT TCACCGTCCT GGGCGCGCAC 
AGCGCTTGGG CGCAGACCGC GCCGACGCCG GCCCAGGATT CGTCGACCAT CGAAGAGGTG
GTCGTCACCG CCGAGCGGGT CGCGGGCAAT GTCCAAACCA CGCCGATCTC GATCGCCGTC
TATTCGGGCC AGGCCCTGGA GGCTCGGGGC GTGGCCAGCA TTCGCGCGCT GAGCGTCATC
GACACCAGCA TCAACTTCTC CTCATCGGGC GGCCAGAACG TCGTTTCCAT CCGCGGCGTG
ACGAGCACCA ACGCCACCGA GACCGGCGAG CCGGCCGTCT CGGTGTCGAT GGACGGGGTG
TTCAACAACC GCGGCTATTC GCTCGGCGCC ACGATGTACG ACATCGCCCG GGTCGAGGTC
CTGCGCGGCC CGCAGGGCAC GCTGTTTGGC CGCAACACCA CCGGCGGCAT GCTCAACATC
ATCACCCAGC GCCCGGGCGG TGATTTCGCC GGCCGCGTGA CCGTCGACTT CGGCAATTAC
GACACCAAGA ACTTCGACGG CTTCCTGAAC GTGCCGATCA GCGACACCTT CAAGATGCGC
GCGTCCTTCT CGTCGCGCTA TCGAGACGGC TTCCGCAAGA ACACGCCGTT CGACCAGCGG
GCCGACGACG AGATCAACAA CTCCGGCCGC CTGCAGTTCG CCTGGGAGCC CACCGCGCGT
CTGCGCACCT GGCTGTCGCT GTCGGCCACC CATGAAGGCG GCATCGGCGG ATCAAGCGAG
TCGATCCCGT TCCGCTACGC GCCGGGCACC CCGCTAAACG CCACGGGTCA GCCCACCACG
CCCGGCACGC CGCCGCTTCA CACCATGCCG CCGCTGAGCG ACGGGGTGCA CTACACGATC
TATAGCAGCG CCGATCAGAA GATCGACACG CGCGACGCCA AGTGGAGCCT GGCCTACGAC
CTCAACGAGG CCGTGACACT CAGCTATCTG GGCGGTTACA ACACCATCGA CTTCATGAAG
GAGCTGCCGA CCTTCTTCCG CGGCAATCCT TCGCTGTTCA CCCGGCGCGA ACATCCCGAC
ACTTGGAATA ATGAAGTGCG GATCGCCTCG AACGCCGGTG GCCCGCTAAC CTGGCAAGCC
GGGGCCTTCG CCTATAGCGA AAAGTCGGGC CTGTTCACCG ACTTCATTCG CAATCCGGGC
GCGGCCACCG CCACCGAACT CTATCAATTC GATACGCCCC TGGTGAAGGC GACGTCCAAG
GCCCTGTTCG CCCAAGCCAA TTTCAACATC ACCGATGCGT TGAAGATCAC CGGCGGCGTG
CGTCACACCT GGGATGAGAA AACCCGTCGC GGCATCTTCC GCATTCGTCC GGCCTATACC
GGCGCGCCGG TGACCGTCAC CCAAACCCAG GACGGCGACG CCAAGTCCGA CAAGACCACA
TGGTTGGTCG GCCTCGACTA TCAAGTCACC GACCGCAACC TGCTCTACGG CAAGGTCAGC
ACCGGCTATA AGAGCGGCGG CTTCACCGGC GCCAGCCAGT ACAAGCCCGA GACCATGATC
TCCTACGAGG TAGGAACCAA GAACCGGTTC TTCGACAACT CCCTGCAGCT GAATCTGGCC
GCCTACCTGA TGAATTATGA AGACCAGCAG GTGAACCAGT ACCTGGCGGT TGGTGGCGGC
CCGGTGCAGG CGGTGACCAC CAACGCCGGC GAGTCGGAGA TCTACGGCCT GGAAACCAAC
GTCATCTCCA CCTCGGATCT TGGTCGGTTC GAGCTATCGG CCAACCTTTT GCACGCCCGC
TACAAGACCT TCGTGCTGGG CGCGGGCTGG TCCTCGCCTC CGGCCGTCAC CGTGCTGCTG
GACCTGGAAG GCAACCGCCT GCCGGTCAGC CCCGACCTTT CGGTCAGCCT GCAATGGGAA
AAGTCGTTCG CCCTCTTCGG CGGCGCGCTA ACCCCGCGCG CGGCGGTCAA GCATCAAACC
AAAATCTACT ACGCGCCCAA CAACTACGAG GACCAAAAGC AGGGGGCGTT CGAGACTGTG
GATGTCAGCG CCACCTGGGC GCCGGCCGCG GGCAACTGGT CGGTGCAAGC CTACGCCAAC
AACCTGTTCG ACGTCGATCG CATCAACTAC GCCGACGAGA ACTATAATTT CGGCGTCTAC
AACGTGGCCT ATGCGCCGCC GCGCACCTAC GGCGTGCGAA TCAGCGCCTC GTTCTAA
 
Protein sequence
MRHRKAWLLC ASAFTVLGAH SAWAQTAPTP AQDSSTIEEV VVTAERVAGN VQTTPISIAV 
YSGQALEARG VASIRALSVI DTSINFSSSG GQNVVSIRGV TSTNATETGE PAVSVSMDGV
FNNRGYSLGA TMYDIARVEV LRGPQGTLFG RNTTGGMLNI ITQRPGGDFA GRVTVDFGNY
DTKNFDGFLN VPISDTFKMR ASFSSRYRDG FRKNTPFDQR ADDEINNSGR LQFAWEPTAR
LRTWLSLSAT HEGGIGGSSE SIPFRYAPGT PLNATGQPTT PGTPPLHTMP PLSDGVHYTI
YSSADQKIDT RDAKWSLAYD LNEAVTLSYL GGYNTIDFMK ELPTFFRGNP SLFTRREHPD
TWNNEVRIAS NAGGPLTWQA GAFAYSEKSG LFTDFIRNPG AATATELYQF DTPLVKATSK
ALFAQANFNI TDALKITGGV RHTWDEKTRR GIFRIRPAYT GAPVTVTQTQ DGDAKSDKTT
WLVGLDYQVT DRNLLYGKVS TGYKSGGFTG ASQYKPETMI SYEVGTKNRF FDNSLQLNLA
AYLMNYEDQQ VNQYLAVGGG PVQAVTTNAG ESEIYGLETN VISTSDLGRF ELSANLLHAR
YKTFVLGAGW SSPPAVTVLL DLEGNRLPVS PDLSVSLQWE KSFALFGGAL TPRAAVKHQT
KIYYAPNNYE DQKQGAFETV DVSATWAPAA GNWSVQAYAN NLFDVDRINY ADENYNFGVY
NVAYAPPRTY GVRISASF