Gene Caul_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1941 
Symbol 
ID5899396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2081178 
End bp2083208 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content64% 
IMG OID641562431 
ProductTonB-dependent receptor 
Protein accessionYP_001683568 
Protein GI167645905 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.891675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTGAAA GTCACACGAT CACGACCGGC ACGGACCTGC AGAAACTGAT CCCCACCCTG 
AACGTCGGCG TCAGCATCTT TGGCGGCACG CAACAGTTCT CGCTTCGGGG CGTGCGCACC
GGCGTGGTCA GCTACCTGAA CGAAGTGCCG GTCGACGGCG TGCTCGCCGA CCAGATGCTG
TGGGACCTGT CGTCCATCCA AGCGATCTCG GGACCGCAAG GGACGCTGTT TGGCAAGAAC
AGCACCGGCG GCGCCGTCCT GTTCGTCCCC AATCAGCCCG GCGAAGAGTT CGAGGGTTAT
GTCGAAGGCC GCCTCGGCAG GTTCAATCTG CGCGAAGGCA CCGCGGTCGT GAACCTTCCG
GTCAATGACA AGCTGGCCCT ACGCATCGGC GCCCGCGTGA CGAAGCGCGA CGGGATCATC
GACAATCTGA CCGGCCCGGA CCTGCAGGCG CAGGACCACC GATCGCTGCG GGTGTCGGCG
CTGTTCAAGC CCAACGCCGT CCTGACCAAC TACACCACGT TCAACTACGC CCATCGGGAC
GACACGCCCT ACGCCCAGAT CAGCGGATCG GGCGCTGGCA CGCCCAGCTG CCCCACCGCC
CTGCCCGCCT GCGTCTATGG CGCAAGCTAC GCCAACGAGC TGGCCGCCCA GCGCGCCCGC
GGTATCCGGA CCGTGTCGAT CCCGCTGGAC GCCAGCCAGT CCGCCTCGCC CTGGCAGCTG
ACCAACGTGC TGAGCGGACA CTTTGGCGCG GTGACCGCCA AGTACATCTT CGGCTATCAG
AAGAACAAGG ACCGCCAGTT CACCAGCCAG CTTTCGATCC CGCTGCCGGT CATCATCGGC
CTGAACCAGA ACAGGACCAG CCTGAAGACG CGCGAGTTCC AGCTTCTGGG CAGTGCCTTT
ACCGAACGGC TCACCTGGGT CGCGGGGCTG TACGCGTCCG ACAGCGACGT GAACAACTTC
AACAGCTATC TGCTGTTCGC TCCGGTCGGC ACCCCGCACA ACAACAACAC CACCCAGCAG
ACCGGTGGCA ATACGACGAC GGATTCCAAG GCCGCCTACG CCCAGGGCAC CTTGGCCGTG
ACCGACCGGT TCAACGTGAC CGTGGGCGCC CGGTACACCC AGGATGACGT GAAGACGGCC
CAGTTCGGCT ACAGCCCCGG GCACGTCTGC AACCTTCCGG CCGCCCTGCC CAGCGTCAAC
ATCGCGACCT GCACCCAGCG GATCGCGGCC AAGACCGATG CGGTGACCTA CAACCTGTCG
GCGGATTTCA AGGTGTCCGA CGACGTCCTG CTCTATGCGA CCACGCGCAA GGGCTACAAC
GCCGGCGGCT TCAATCCGAA CATCAACGAC GCCGATCTGG AAGTCGTCAG GCCCGAGTAC
ATCACCGACT ACGAAGGGGG CCTGAAGGCC GACTGGAGCC TGGGCGGCAT GCCGGTCCGG
ACCAACATCT CGACCTTCTA CGCCAAGTAC AAGGACATCC AGCGCACCAC CTCGCTGGTG
TTCGACAACT TGATCGTCAC CGGCAATTTC AACGCCGCCA AGGCGACGAT CTACGGCGCC
CAGATCGAGA TCCTGGCCCG TCCGGTCGAG CCGCTGACGC TGCAAGCGTC CTATGGCTAT
CTGCACACCA AGTATGACAG CTTCCAGAAC GCCCTGCTGG GCGACGTCAC CGGCAACAGC
TTCGCCCAGG CGCCGGAGGA CACGCTCAAT GTCTCGGCGA CCTACCGCCA TGCCTTGCCG
TCCGGCGAAC TCGTCGCCAA CGTCAGCTAC GCCTATATCA GCAAGGTCGC CTACTCCGAC
GACAACCTGA CGACGCCCGG CAATATCGCG CCGGGCTACG GCCTGGTCGA TGCCCGACTG
GACTGGAAGA AGGTCGGCGG CAGCGCCGTC GACCTGGGCG TCTACGTCAA GAACGCGACG
GACAAGGAAT ACCTTCTCAA CACCACCGAC CGGACCGGCC GGTTCGGCTT CGACTCCCGG
GTCTATGGCG ACCCCCGGAC CTTCGGTGTC GAGATCCGCT ACTCATTCTA A
 
Protein sequence
MLESHTITTG TDLQKLIPTL NVGVSIFGGT QQFSLRGVRT GVVSYLNEVP VDGVLADQML 
WDLSSIQAIS GPQGTLFGKN STGGAVLFVP NQPGEEFEGY VEGRLGRFNL REGTAVVNLP
VNDKLALRIG ARVTKRDGII DNLTGPDLQA QDHRSLRVSA LFKPNAVLTN YTTFNYAHRD
DTPYAQISGS GAGTPSCPTA LPACVYGASY ANELAAQRAR GIRTVSIPLD ASQSASPWQL
TNVLSGHFGA VTAKYIFGYQ KNKDRQFTSQ LSIPLPVIIG LNQNRTSLKT REFQLLGSAF
TERLTWVAGL YASDSDVNNF NSYLLFAPVG TPHNNNTTQQ TGGNTTTDSK AAYAQGTLAV
TDRFNVTVGA RYTQDDVKTA QFGYSPGHVC NLPAALPSVN IATCTQRIAA KTDAVTYNLS
ADFKVSDDVL LYATTRKGYN AGGFNPNIND ADLEVVRPEY ITDYEGGLKA DWSLGGMPVR
TNISTFYAKY KDIQRTTSLV FDNLIVTGNF NAAKATIYGA QIEILARPVE PLTLQASYGY
LHTKYDSFQN ALLGDVTGNS FAQAPEDTLN VSATYRHALP SGELVANVSY AYISKVAYSD
DNLTTPGNIA PGYGLVDARL DWKKVGGSAV DLGVYVKNAT DKEYLLNTTD RTGRFGFDSR
VYGDPRTFGV EIRYSF