Gene Caul_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2647 
Symbol 
ID5900102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2873929 
End bp2876472 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content68% 
IMG OID641563138 
ProductTonB-dependent receptor 
Protein accessionYP_001684272 
Protein GI167646609 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.492144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTGG GCGCGGTCGC GGCGATGCTT TGCGCCACGA GCTGCCCGAT CGTCTGGCTG 
GGCGTCCCGG CGCAGGCCGA AGCCGCCAGC GTGGCGGCGG CGACCCGCTT CGACATTCCC
GCCCAGCCGC TCAGCAAGGC GCTCAACCAA CTGGCCCTGC AGAGCAACCG CGAAATCCTG
TTCTCGCCCC GCCTGACCGT CGGCAGGCGA AGCCTCCCGC TCAAGGGCGC CTTCACCGCC
GAGGAGGCTC TGGCGCGCCT GCTGTCGGGC TCAGGCCTGA CCGTGCGGCT GGAGGGGCGC
AGCTTCCTCA TCCAGCAGGC GCCCGCCCCG CCCCCGCCGC AACGCCTGCT CTCACCGCCC
GAAACGGCGC CCGCCCAGGA CATGCCATTG GAATCGGTGG TGGTGACGGC CCTCAAGCGT
GACAGCGAAC TTCAGAAGAC GCCCGTCAGC ATGACCGTGC TGAGCGGCGA AAAGCTCTCG
CGTCTCGGCA TGGTCGATCT TGAGCAGGCC GCCCCCCTGC TGCCAGGCCT GAAACTAATA
TCGACGGCGT TTGGCCGCCG GCTGGTCCTG CGGGGCGTCT ACGGCGCCGG CGAGGCCACC
ACGGGCCTCT ACTACGACGA GACCCCGATG ACCGGCCCCG TGGGCACCAC GGCCGATCCG
GGCGTCATGT CCCCTGAGCT GGCCCTGGTC GACATCGACC GCATTGAGCT GCTGCGCGGC
CCGCAAGGCA CGTTGTACGG CTCGGGCTCC ATGGGCGGCG CTCTGCGGGT GCTGTTCCGG
CATCCAGATC TGAACGAGAC CGAAGGCTTC GTCAGCGCCA ACGCCACCAG CCTCGCCCAT
GGCGGTCAGG CCGGCGCGGT GACCGCCGCG TTCAACGCCG TGCTCGCGCC AGGCGTGCTG
GCCGTTCGCC TGACGGCCTA TGATCTCAAG ACGCCCGGCT TGATCGACAA TATCCGCCTG
GGCCTCTCCG ACGTGGATTC CTCGGAGGTG AAGGGCGCGC GCCTGGGCCT CCGATGGCAG
ATCAGCGATC GGCTTTCGCT GCTGCTGTCG GGCACGAGCC AAGACACCCA TCGCGACGAC
ATCTCGGCCT GGCACGACAT CGCCGGCCCC TGGCGCACGC GCCACGCCGC CCGGGCCCCG
TTCGACGGCG ACATCGACCT GGCCAGCGCC ACCCTGCATT GGGAGGGCGA CGCGGTGGAC
GTCATCGCCA CCGCGTCCCG CTATCGCTGG CGGCTGACCC GACGCTCGGA CTACAGCGGC
GTGTTGCTGG GCGAACGCGA CAGCGCATCG GGGTGCAAGC GCTATTACGT CCTGAGCGGA
ACGTGCGACG CAAGCCAGCA GCAAGGCTAC GCCGGCTATG TCGACAGCCT GTATCCGGCG
ATCCTCTACC AGCCCGCCAC GCTGACCTCG TCGATCCAGG AGATCCGCGC CTCTTCGATC
CGCGAGAGCC CGATCAGTTG GACGGTCGGC GTCTATAACG AGGTTCGCAA CGACCATATC
GACAGCCAGG TGCCGCACGT TGATCCGCTG ACAGGTGCGG TGCAGCAGCC CGTGATCCTG
ATTGGTCGCC GCAGCATCGA CAACCACCTC AGTCAGAGTG CTGTGTTCGG CGAGATCTCC
TACGATCCCG CCCCGGCCAC CCGTTTCACC CTGGGCGCCC GGCGGTTCGA CTACATCAAG
CGCGACAGCG GCGAGGTGCA GGTCCCCAAT GTCGTGTCGG GCACCTGGGC GGACTACATG
ATCGACGCGC GCACGCCAGA GCGGGGCTGG AGCTTCAAGG CCCTGGGCAG CCAGCAGATC
ACCCCGGACA TCCTGGGTTA TGCCCAGGTC TCGCAAGGCT TTCGCCCCGG CGGCGTCAAT
GTCGTTCCCG GCCTGGCCGA CAACTTGGCG CCTTATCGGT CCGATCATCT GACCAACTAC
GAAGTCGGGC TGAAGACTCA GTCCGCCGAC CGGCGCTTCA CGGCCAATTC CGCAGTGTAC
CAGATCGACT GGCGCGACAT GCAGTACAGC GCCCAGACGC AGAACCGCGC CTTCTCGTTC
CTGACCAATA TCGGCGCCAG CCGCATCCGT GGGGTCGAGA GCGAATTCAC AGCCCGCCGC
CTATGGGGTT GGGACGCCGC CGCCAGCGCC ACCTTCACCG ACGCGCGCCT GACCGCCGAC
CAGATCACCA ACACCGCCAT CGGCCTGGGT CGAAAGGGCG ACCGCCTGCC GGTCGTGCCC
AAGTTCGCGA CCGCCGCGTC GATTGAGCGG CAATGGCCGC TGACCGGCAC GCTCGAGGGC
CGCTTCCGGA TCGACGCGGC CTATACGGGA ACATCGCGCT CGGCGTTCAA CATGGGCAAC
GCCGACTATC TGAAGATGGG CGGCTACGCC ACGTTCGGCG TCAGCTTCGG CGTCGAGCAC
GCGAACTGGC GGGCCGACCT CGCGCTCGAC AACCTGCTGG ACCGAGCCGG CCGCGCCTCC
GCCCAGCGCA ACACCTCCGG GCCGATCGAC TATTACGGCA TCCCGCCGCG CACGCTGCGG
ATCACTCTGG AGCGCGGCTT CTAA
 
Protein sequence
MRLGAVAAML CATSCPIVWL GVPAQAEAAS VAAATRFDIP AQPLSKALNQ LALQSNREIL 
FSPRLTVGRR SLPLKGAFTA EEALARLLSG SGLTVRLEGR SFLIQQAPAP PPPQRLLSPP
ETAPAQDMPL ESVVVTALKR DSELQKTPVS MTVLSGEKLS RLGMVDLEQA APLLPGLKLI
STAFGRRLVL RGVYGAGEAT TGLYYDETPM TGPVGTTADP GVMSPELALV DIDRIELLRG
PQGTLYGSGS MGGALRVLFR HPDLNETEGF VSANATSLAH GGQAGAVTAA FNAVLAPGVL
AVRLTAYDLK TPGLIDNIRL GLSDVDSSEV KGARLGLRWQ ISDRLSLLLS GTSQDTHRDD
ISAWHDIAGP WRTRHAARAP FDGDIDLASA TLHWEGDAVD VIATASRYRW RLTRRSDYSG
VLLGERDSAS GCKRYYVLSG TCDASQQQGY AGYVDSLYPA ILYQPATLTS SIQEIRASSI
RESPISWTVG VYNEVRNDHI DSQVPHVDPL TGAVQQPVIL IGRRSIDNHL SQSAVFGEIS
YDPAPATRFT LGARRFDYIK RDSGEVQVPN VVSGTWADYM IDARTPERGW SFKALGSQQI
TPDILGYAQV SQGFRPGGVN VVPGLADNLA PYRSDHLTNY EVGLKTQSAD RRFTANSAVY
QIDWRDMQYS AQTQNRAFSF LTNIGASRIR GVESEFTARR LWGWDAAASA TFTDARLTAD
QITNTAIGLG RKGDRLPVVP KFATAASIER QWPLTGTLEG RFRIDAAYTG TSRSAFNMGN
ADYLKMGGYA TFGVSFGVEH ANWRADLALD NLLDRAGRAS AQRNTSGPID YYGIPPRTLR
ITLERGF