Gene Caul_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2548 
Symbol 
ID5900003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2766256 
End bp2769261 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content64% 
IMG OID641563039 
ProductTonB-dependent receptor 
Protein accessionYP_001684173 
Protein GI167646510 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACATCA AATCCGTACG GGAGCGCCTT CTGGCCTCCA CCATGATCTG CGGCGCCGCC 
CTAGCGACGC TCGCGGCCAG CCCCGTCTCG GCGCAGACCA CACCGGCCCC GGCCGCTGAT
GAAGTCGAAG AAATCGTCGT CACCGGCTCG CTGTTCCGCC GCACCGACAC CGAAACGCCC
TCGCCCGTGA CCGTGCTGAC CTCGGAAAAC CTTCAGCGCG CCGGCATCTC GACCGCCTCG
GACGCCATCC GTTCGATCTC GGCCGACGGC GCCGGCTCGA TCGGCACCGG CTTCCAGAGC
GGCTTCAGCG CCGGCGGCTC GGCCGTCTCG CTGCGCGGCC TGGGCGTCTC CTCGACCCTC
GTGCTGGTCG ACGGCCTGCG TTCGGCCAAC TTCCCGATCA ACGACGACGG CCACAACGCC
TATGTCGATC TGAACTCCAT TCCGTTCAGC CTGATCGACA GCATTGAAGT CCTGAAGGAC
GGCGCTTCGT CGTCGTACGG CGCCGATGCC ATCGGCGGCG TGGTGAACCT GAAGCTCAAG
AAGCAGTTCG TCGGCGTCAA GGGCAGCGCC GAAGTCGGTC AGAGCGACCG CAGCGACGCC
GAGCACCGGC GCGCCGACGT CACCCTGGGC TACGGCGACT ACGCAGAGAC CGGCTGGAAC
TTCTACGTCA ACGCCGAGTA CCAGAAGGAC GACCGGGTCA CGAGCCACAG CCGCGGCTTC
CCGTTCAACA CCCAGGACCT GCGCTCGATC GGCGGCCTGG ACCTGAACAC CGCCGACAGC
TCGCTGACCA CGGCGACCCC GAACGCGGTC GTTGTGCGGA CCACCCAAAC CGATCTTAAC
AACCCGCTCG CCGGCGGCGC TTCGTTGATC CCGAGCGGAA CCTATGTCGA CGGCGACGGC
AAGACTCAGA ATTACTCCAA CTACACCACG CTGAACACGA ACTGCGCCAA CGGCCCCTAC
ACCTCCACCA GCCTCAGCGC TCGCGGCAGC GGCTGTAAGT GGGACTTGGT CGACACCTAT
CGCCAGATCC AACCGCTGCA GAAGCGCTAC GCTTTCAATG GCCGCCTGAG CATGCGCCTC
AACGAGAACA TCGAGGCCTA CGCCACCGGC AGCTATTCCA ACAGCTACGT CAGCATCAAG
GGCGCGCCAA CGGCGGTCCG CGCCACTCAG CCCTTCGGCG GCGCGCCCTC GCTGGCGTCC
AGCAACCCGG GCATCGTGCT TCCGGTCTAT GTCTGTACGT CGGGCATCAA CTGCGCCACC
GCCGGCGCGC CCGGTCAGCG TCTGAACCCG AACAACCCCT ACGCGGCCGC CTTCGCTAAC
GATCCGGCGA ACGGTGCGGC CCGCCTCTAC TACTTGTTCG GCGACATCCC CGCCGGCAGC
GAGCGCTCGA ACGAAGTCAT CCGGGGCACG TTCGGTCTAA AGGGCAGCTT CGGCGACGAC
TGGAACTGGA GCGTGGACGC GGCCGGCGCT CGTGACAACC TGAAGATCAC GCAGCATGGC
CTGCTGAACA TCGCCAACTT GATGAACTCG ATCAATACGG GTTCCTACAA CTTCGTCGAC
CCGTCGAAGA ACACCCAGGC GGTTCGCGAC TTCATCGCGC CGGACAAGAC CACCCCGTCG
CACTCGTCGA TGATGTCGTT GGACGGCTTG ATCACCAAGT CGCTCTGGAC CCTGCCGGGC
GGTGACCTGC AGGTCGGCGT CGGCGCCCAG ATCCGCAAGG AAGTGCTGGT CAACAACAAC
CAGAACGTCC GTCTGGACAC CTACGGCCTG ACGACGGCCT CGGCGTTCGG CAAACACACG
GTCAAGGCCG CGTTCTTCGA AGTCAACGCC CCGGTCCTTG AACAGCTTGA GCTGAACGTC
TCGGGCCGTT ACGATGACTA TTCGGAAGGC TTCAGCCACT TCTCGCCGAA GTTTGGCGTC
AAGTACACGC CGATCAAGCA ACTGGCGTTC CGTGGCACCT TCTCGAAGGG CTTCCGCGCC
CCGACCTTCG CCGAGTCCGG CCCGCGTTCG CAATACGCCG GCTTCGTGAG CACCACGCCG
CCGGCCGCCT TCGTGAACGC CCACGGCACC TCGAGCGCCA ACAATCCGTA TGCCCAGCAA
TACAGCCTGG GCCGCGGCGT GGCCGGCAAC CCGAACCTGA AGCCGGAAAC CTCGCGCAGC
TTCACCATCG GCGCCATCGC CGAGCCGACC AGCTGGCTCA GCCTTACGGT CGACTACTAC
AACGTGAAGA AGTCTGACCT GATCACCTCG GGTCCCGATA TCAGCAAGGC GGTTGCGGCC
TACTACGGCC AGACCACCCA GGCGGCCGGC TGCGCCGCTA TCGCGGCAGG TTATCCGGGG
TACTCGTGCA ATGTGGTGGA CGCCGTCGAC CCGTTGTATC CGACCGCTCA GCCGCGCGTG
CTGATCATCA ACGTCCCGTA TGTGAACGCA AACTACGCGA TCACTTCGGG CGTGGATTTT
GCGGCCACCG CCAAGGTCCC GGTCACGGAC AACATCAAGT GGACCAGCCG CGTTGAAGTC
ACTCACCTGC TGAAGTACGA CCTGCACACC TCCACGGAAG TGCAGAAATA CGCCGGCACC
CTGGGTCCGT ACGATCTGTC GTCGGGCAAC GGTACGCCGG ACTGGAAGGG CAACTGGCAG
AACACGGTGG ACTTCGGTCG CTACACCGTG TCGGCAACGG CCTACTATGT GGGTTCGATC
AAATCGGTCG CCGCCGACAC CAACGGCAGC ACCGACTGCC CGAAGGGCAA CCCCTACGGT
GGCGCGGCCA ACCCGGCTGC CGCCAACAAG TTCTGTAAGA TCAAGAGCTT CGTTAATGTC
GATCTGAACG GCACGATGCA GTTGAACGAC GGCGTCCAGC TGTACGGCAA CGTCGGCAAC
CTGTTCGACG AACGGGCGCC GATCGCGCCG GGCGCTTACG CCAGCGCGCC GAACTTCCTG
ACCACCTTCC ACTATGCCGG CCTGATCGGC CGGACGTTCA AGGTGGGTGT CCGCTTCCAG
TACTAA
 
Protein sequence
MNIKSVRERL LASTMICGAA LATLAASPVS AQTTPAPAAD EVEEIVVTGS LFRRTDTETP 
SPVTVLTSEN LQRAGISTAS DAIRSISADG AGSIGTGFQS GFSAGGSAVS LRGLGVSSTL
VLVDGLRSAN FPINDDGHNA YVDLNSIPFS LIDSIEVLKD GASSSYGADA IGGVVNLKLK
KQFVGVKGSA EVGQSDRSDA EHRRADVTLG YGDYAETGWN FYVNAEYQKD DRVTSHSRGF
PFNTQDLRSI GGLDLNTADS SLTTATPNAV VVRTTQTDLN NPLAGGASLI PSGTYVDGDG
KTQNYSNYTT LNTNCANGPY TSTSLSARGS GCKWDLVDTY RQIQPLQKRY AFNGRLSMRL
NENIEAYATG SYSNSYVSIK GAPTAVRATQ PFGGAPSLAS SNPGIVLPVY VCTSGINCAT
AGAPGQRLNP NNPYAAAFAN DPANGAARLY YLFGDIPAGS ERSNEVIRGT FGLKGSFGDD
WNWSVDAAGA RDNLKITQHG LLNIANLMNS INTGSYNFVD PSKNTQAVRD FIAPDKTTPS
HSSMMSLDGL ITKSLWTLPG GDLQVGVGAQ IRKEVLVNNN QNVRLDTYGL TTASAFGKHT
VKAAFFEVNA PVLEQLELNV SGRYDDYSEG FSHFSPKFGV KYTPIKQLAF RGTFSKGFRA
PTFAESGPRS QYAGFVSTTP PAAFVNAHGT SSANNPYAQQ YSLGRGVAGN PNLKPETSRS
FTIGAIAEPT SWLSLTVDYY NVKKSDLITS GPDISKAVAA YYGQTTQAAG CAAIAAGYPG
YSCNVVDAVD PLYPTAQPRV LIINVPYVNA NYAITSGVDF AATAKVPVTD NIKWTSRVEV
THLLKYDLHT STEVQKYAGT LGPYDLSSGN GTPDWKGNWQ NTVDFGRYTV SATAYYVGSI
KSVAADTNGS TDCPKGNPYG GAANPAAANK FCKIKSFVNV DLNGTMQLND GVQLYGNVGN
LFDERAPIAP GAYASAPNFL TTFHYAGLIG RTFKVGVRFQ Y