Gene Caul_0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0123 
Symbol 
ID5897835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp134539 
End bp136551 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content65% 
IMG OID641560608 
ProductTonB-dependent receptor plug 
Protein accessionYP_001681759 
Protein GI167644096 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0182279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA CACTTCTGGC CGGCGTTTCG GCTACCTTCG CCCTCTCCCT GGCGGCCACC 
GCCGCCCACG CCCAGTCGAT CGACTACGGC GCGATGGAGC AAATCTTCAA CGAACCGGTG
ACCACCAGCG CTACCGGCTC GCCCCAGCGC GCCACCGAAG TGCCGGTCGA TATGACCATC
ATCAGCGCGG CGGACATCCA TCGCTCGGGC GCCATCGACA TTCCGACCAT CCTCAGCCGC
GTCGCCGGCG TCGACATACT TCCCTATGCG GCCGGCTACT CGGACGTGGG CGTCCGCGGC
TACGATCAGG CCCGCTCGCC ACGCCTACTG GTGCTGATCA ACGGCCGCCA GGTCTATTTC
GACCACCAAG GCTACACCGA CTGGACCTCG CTGCCCGTCG AGCTGAGTGA AATCCGCCAG
ATCGAAGTCG TCCGCGGCCC CAACAGCGCC CTGTTCGGCT TCAACGCCGT CTCCGGCGTC
GTCAACATCA TTACCTACAA CCCCAAGCTC GACAACGTGG GCGCGGTCAC CGCCTATGCT
GGCACCCACG GCGCAGCAGG CGTTTCGGCC GTGCAGACCA TGCACATCGG CGACCGCTTC
TTCATGCGGC TCTCGGCCGG CGGTCGGAAG CAGGATCAGT GGAAGAGCAC TACGCCGGCG
GTTCCGGCGT CTCTCATCCA AGACCCGCAA TCGACCCGGG CCAACGTCGA CGCCATCGTG
CAACTGGCGC CCAAGACCGA ACTGCGCCTG GAGGGGTCCT ATTCCAATCT GGCCGGGACC
CAGGTGAACT CCATCTACTC GTTCGGCCAC CAGCGGAACC AGACCCACTC CGTCAAGGCC
ACGCTATCGT CCGACACGGG CTTTGGCCTG CTGCAGGCTC AGGTCTATCA GAACCACATG
GACGCGCTGA CGCCCGGTCG CCGTTACGTG AACCGGATCA CCGTGGGTAG CGTTCAGGAC
CTGTTCAAGG TCGGCACGAA CCAGACCTTC CGCGTCGGCC TCGAATATCG CTCCAACGAG
ATGGACACCT CGCCCTTCGC CGGCGGCGTG GTCAGCTACA CGGTGCTGGC GCCTTCGGCC
ATGTGGAACT GGGCGATCAA CGACAAGGTC TCGCTCACCG CCGCGGCCCG CTATGACCAG
CTGAAGCTGA AGCGGACCGG CCTTCTCCCC CCCGGCTATG CGTTCACCAA CGCCGACTGG
GACAGAACGA TCAGCGAGCC CAGCGTCAAT CTAGGCGCGG TCTACAAGGT CACGCCGGCC
GACAGCCTGC GTCTGACCTA CGCCCGCGGG GTCCAGGCTC CCACCCTGTT CGAACTGGGC
GGACTGGTGT TCGCGCCGAC GGGCGCCCTC GGCTCGGGGA CGATCAACTC TGGCAACCCG
AATATCTCGC CGTCGATCGT GGCCAATTAT GAGCTGGCCT ACGACCACGA CTTCGCGCCC
CTGCGGGCTC GCGCCGGCGT AAAGGCCTTC TCGCAAACCA CGAAGGACGT GAAGGGCAGC
TCCTACCCGA CGATCATCGA CGTTCGCCCC ACCCCCACGA CCTACGGGGC CAACCTCTAC
CAGAACGTCG GCGACTCGAA GATGGTGGGC TTCGAGGCCT CGGCCTCGGG CAAGTTCGGA
AAGGGCTTCA ACTGGAGCGC CGACACAACC TATACCGACG TCAAGAACCA GCCTTCGGTC
ACCGACGCCG CGCTTCTGCT GCGCCGGAGC ACCTTCGCCC AGTTCACGCC GAAATACCGC
AGCAACCTGG CCCTCGGTTG GACGGGCGGG AAGTGGACCA CGGACGGCTA TCTGCACCAC
ATCAGCAAGG CGATGACCCT CAACGGCGCC GCGCTGGAGC CGGTGGCCGC CTACACCACC
CTTGCCAGCC GGGTCGCCTA TATGGCGCCC ATGGGCATCG AGCTCGCCGT TAGTGGCCAG
AACCTGCTGC ACGACCGTCA AGCGCAGGCG AAGGGGGCGA CCGGCCTGAT GGCCGAGCGT
CAGGTGATGT TCACCGTCAG CAAGGCCTGG TGA
 
Protein sequence
MKKTLLAGVS ATFALSLAAT AAHAQSIDYG AMEQIFNEPV TTSATGSPQR ATEVPVDMTI 
ISAADIHRSG AIDIPTILSR VAGVDILPYA AGYSDVGVRG YDQARSPRLL VLINGRQVYF
DHQGYTDWTS LPVELSEIRQ IEVVRGPNSA LFGFNAVSGV VNIITYNPKL DNVGAVTAYA
GTHGAAGVSA VQTMHIGDRF FMRLSAGGRK QDQWKSTTPA VPASLIQDPQ STRANVDAIV
QLAPKTELRL EGSYSNLAGT QVNSIYSFGH QRNQTHSVKA TLSSDTGFGL LQAQVYQNHM
DALTPGRRYV NRITVGSVQD LFKVGTNQTF RVGLEYRSNE MDTSPFAGGV VSYTVLAPSA
MWNWAINDKV SLTAAARYDQ LKLKRTGLLP PGYAFTNADW DRTISEPSVN LGAVYKVTPA
DSLRLTYARG VQAPTLFELG GLVFAPTGAL GSGTINSGNP NISPSIVANY ELAYDHDFAP
LRARAGVKAF SQTTKDVKGS SYPTIIDVRP TPTTYGANLY QNVGDSKMVG FEASASGKFG
KGFNWSADTT YTDVKNQPSV TDAALLLRRS TFAQFTPKYR SNLALGWTGG KWTTDGYLHH
ISKAMTLNGA ALEPVAAYTT LASRVAYMAP MGIELAVSGQ NLLHDRQAQA KGATGLMAER
QVMFTVSKAW