Gene Caul_2677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2677 
Symbol 
ID5900132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2909504 
End bp2911783 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content65% 
IMG OID641563168 
ProductTonB-dependent receptor plug 
Protein accessionYP_001684302 
Protein GI167646639 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.126717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAGC TTCAAGTGCT CGCGGGCGTC GCCGGCGCGA CGATGATCAT GGCGGCCAGT 
CAGGCGGCGG CGCAGGAGGC CACCTCGCTC GAGGAGATCG TCGTCACCGC CCAGAAGCGG
TCGGAGAACC TGCAGGACGT GCCGGTCTCG GTGACCGCCT TCACCGCCGA CCAGCTCAAG
AACCAACGGG TCGGCGACGT CCTGGCGCTG TCGGGCCTGT CGCCCGGCCT GCAGATCAAG
ACCGACGACA ACGCCGCCAA TCCTCGCATC TTCATCCGCG GCATCGGGGT CAACGATTTC
AATCCGGCCA CGGCCAGCGC CGTCGGCGTC TATGTCGATG GGGTCTATGT GGCCTCACCC
CTGGCCCAGA TGGCCGGGTT CTATGATCTG CAGCAGGTCG AGGTGCTGCG GGGACCGCAG
GGCACGCTGT ACGGCCGCAA CACCACCGGC GGGGCGATCA ACGTCACCAC CAAGAAGCCG
AGCAGCACGC CCGAGGGCGA TCTGGCGGTC GACTACGGCC GGTTCAATTC GCTCAAGGTT
CAGGGCGGCT TTGGCGGTCC GGTCGGCGGC GACACCCTGT CGTTCCGGAT CGCCGGCCTC
TACGACAAGA GCGACGGCTA TACGCGAAAC CGCCTGACGG GTCACAAGGG CAATGACGCC
GACCGCAAGG CCGTGCGCGG CGCCCTGCGC TTCAAGCCCG ACGACAAGCT GACCGTCGAC
GTGTCGGCCA GCTACAGCAA GTCGAGCGGC GGCTCGATCC TGACCTACAA CCGCTCGCTG
GTGGCCCAGA CGGCCGAGGC GGCCAGTACC GCCCTGCCCG ACCCCACCTT CGGCTATGCG
TTCTGCAAGC CGGAGTATTA CACCTCCGGC CAGTGCACCA ACGTCGCCGG CTACGCCAAC
ACCAGCAGCA ACAAGTACGA GGGCGACTAC CGCTTCGAGG GCAAGGACGT CGTCAAGCTG
TTCGGGGCCA CGACCTCGAT CAGCTACGAT TTCGGCGGCG TGACGCTCTA CTCGGTCACC
GGCTATCAGC GCGCCAAGCG CGACGACCAT GAGGAGACCG ACGCCAACCC GGTCTCGATC
TTCGACGCCC GCTATATCGC CAAGCAGGAC ACCACCAGCC AGGAATTGCG CCTGCAATCG
AACGGCGCGA CCGCCCTGCG CTGGGTCGCC GGCGTCTACG CCGCGCGCGA CAACCTGGAC
AATGACAGCC ACTACAATGT GCTGGAGGTG GCCCGCGTCC CCGATCCCGT CAACAATCCG
ACCGGCATGG ATCCGGCCAA CAGCGTCGGT GTGTTCGGCT GGCCCCTGCA CCAGAAGAGC
ACCAGCTACG CCGCCTTCGG CCAGGTCGAC TACGACTTGA CCCCCAAGCT GACCCTGACC
GGCGGCCTGC GCTGGTCGCA GGACAAGAAG ACCTTTCACT ATGTCAGCGA CGTGGACTAC
GGCCTGCTGA CCCTGTTCGA ATACGACAAC GCCAAGACCT TCAGCTCGAT CTCGGGACGC
CTGGGCCTGC GTTACGCGCT GAGCGACGAC GCCAATGTCT ATGCGACCTA CAATCGCGGC
ACGAAGAGCG GCGGCTTCTT CAGCGGCCAG ACGACCGATC CCCGGGATCT GGGTCCCTAC
AAGGATGAGA CGGTCAACGC CTATGAGGTG GGAGCCAAGA GCGAGTTCCT AGACCGCCGC
CTGCGGATCA ATGTCTCGGC CTTCTACTAT GATTACAAGG ACCTGCAGGT CTATACCCAG
GTGCAGCGAG ACGGCCTGCC CGTGCAGTTG TTCACCAACG CGTCCTCGGC TCGGGTCTAT
GGCGGCGAGG CCGAGATCGA GGCCCGGCCC ATCTCGGGCC TGAGCCTGAC CTTGGGCGCA
TCCCTGCTGA GCGCCGAATA CAAGGACTTC ATTTCGTACG TGGATCCTTC CTCGGCGCCC
ATCGACTATT CGGGCAACGC CCTGCCCTCC GCGCCGAAGA CCAGCCTGAA CGGCGCGGCG
CGCTATGAGC ATCCATTGGG CGCGGGCGAT CTGGTGACCC AGCTGGACTT CACCTATCGC
GGCAAGGTCT ATTACGACAC GGCCAACACC GAGCGGCTCA GCGACAAGGC GCGCGCCTAC
GTCAACGGCC AGGTCGGTTG GGCCTTCGCC GACGGGCGCT ACGAACTGGG CGTGTGGGGC
AAGAACCTGG CCGACACCAC CAACATCTCC GACATCACGC CGATCGCCGC CTTCGGCTTC
GACGTGTTCA GCATGGGTCC GCCGCGCACC TATGGCGTCT ATTTCAGGGC CAAGTATTGA
 
Protein sequence
MRKLQVLAGV AGATMIMAAS QAAAQEATSL EEIVVTAQKR SENLQDVPVS VTAFTADQLK 
NQRVGDVLAL SGLSPGLQIK TDDNAANPRI FIRGIGVNDF NPATASAVGV YVDGVYVASP
LAQMAGFYDL QQVEVLRGPQ GTLYGRNTTG GAINVTTKKP SSTPEGDLAV DYGRFNSLKV
QGGFGGPVGG DTLSFRIAGL YDKSDGYTRN RLTGHKGNDA DRKAVRGALR FKPDDKLTVD
VSASYSKSSG GSILTYNRSL VAQTAEAAST ALPDPTFGYA FCKPEYYTSG QCTNVAGYAN
TSSNKYEGDY RFEGKDVVKL FGATTSISYD FGGVTLYSVT GYQRAKRDDH EETDANPVSI
FDARYIAKQD TTSQELRLQS NGATALRWVA GVYAARDNLD NDSHYNVLEV ARVPDPVNNP
TGMDPANSVG VFGWPLHQKS TSYAAFGQVD YDLTPKLTLT GGLRWSQDKK TFHYVSDVDY
GLLTLFEYDN AKTFSSISGR LGLRYALSDD ANVYATYNRG TKSGGFFSGQ TTDPRDLGPY
KDETVNAYEV GAKSEFLDRR LRINVSAFYY DYKDLQVYTQ VQRDGLPVQL FTNASSARVY
GGEAEIEARP ISGLSLTLGA SLLSAEYKDF ISYVDPSSAP IDYSGNALPS APKTSLNGAA
RYEHPLGAGD LVTQLDFTYR GKVYYDTANT ERLSDKARAY VNGQVGWAFA DGRYELGVWG
KNLADTTNIS DITPIAAFGF DVFSMGPPRT YGVYFRAKY