Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2548 |
Symbol | |
ID | 5900003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2766256 |
End bp | 2769261 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641563039 |
Product | TonB-dependent receptor |
Protein accession | YP_001684173 |
Protein GI | 167646510 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACATCA AATCCGTACG GGAGCGCCTT CTGGCCTCCA CCATGATCTG CGGCGCCGCC CTAGCGACGC TCGCGGCCAG CCCCGTCTCG GCGCAGACCA CACCGGCCCC GGCCGCTGAT GAAGTCGAAG AAATCGTCGT CACCGGCTCG CTGTTCCGCC GCACCGACAC CGAAACGCCC TCGCCCGTGA CCGTGCTGAC CTCGGAAAAC CTTCAGCGCG CCGGCATCTC GACCGCCTCG GACGCCATCC GTTCGATCTC GGCCGACGGC GCCGGCTCGA TCGGCACCGG CTTCCAGAGC GGCTTCAGCG CCGGCGGCTC GGCCGTCTCG CTGCGCGGCC TGGGCGTCTC CTCGACCCTC GTGCTGGTCG ACGGCCTGCG TTCGGCCAAC TTCCCGATCA ACGACGACGG CCACAACGCC TATGTCGATC TGAACTCCAT TCCGTTCAGC CTGATCGACA GCATTGAAGT CCTGAAGGAC GGCGCTTCGT CGTCGTACGG CGCCGATGCC ATCGGCGGCG TGGTGAACCT GAAGCTCAAG AAGCAGTTCG TCGGCGTCAA GGGCAGCGCC GAAGTCGGTC AGAGCGACCG CAGCGACGCC GAGCACCGGC GCGCCGACGT CACCCTGGGC TACGGCGACT ACGCAGAGAC CGGCTGGAAC TTCTACGTCA ACGCCGAGTA CCAGAAGGAC GACCGGGTCA CGAGCCACAG CCGCGGCTTC CCGTTCAACA CCCAGGACCT GCGCTCGATC GGCGGCCTGG ACCTGAACAC CGCCGACAGC TCGCTGACCA CGGCGACCCC GAACGCGGTC GTTGTGCGGA CCACCCAAAC CGATCTTAAC AACCCGCTCG CCGGCGGCGC TTCGTTGATC CCGAGCGGAA CCTATGTCGA CGGCGACGGC AAGACTCAGA ATTACTCCAA CTACACCACG CTGAACACGA ACTGCGCCAA CGGCCCCTAC ACCTCCACCA GCCTCAGCGC TCGCGGCAGC GGCTGTAAGT GGGACTTGGT CGACACCTAT CGCCAGATCC AACCGCTGCA GAAGCGCTAC GCTTTCAATG GCCGCCTGAG CATGCGCCTC AACGAGAACA TCGAGGCCTA CGCCACCGGC AGCTATTCCA ACAGCTACGT CAGCATCAAG GGCGCGCCAA CGGCGGTCCG CGCCACTCAG CCCTTCGGCG GCGCGCCCTC GCTGGCGTCC AGCAACCCGG GCATCGTGCT TCCGGTCTAT GTCTGTACGT CGGGCATCAA CTGCGCCACC GCCGGCGCGC CCGGTCAGCG TCTGAACCCG AACAACCCCT ACGCGGCCGC CTTCGCTAAC GATCCGGCGA ACGGTGCGGC CCGCCTCTAC TACTTGTTCG GCGACATCCC CGCCGGCAGC GAGCGCTCGA ACGAAGTCAT CCGGGGCACG TTCGGTCTAA AGGGCAGCTT CGGCGACGAC TGGAACTGGA GCGTGGACGC GGCCGGCGCT CGTGACAACC TGAAGATCAC GCAGCATGGC CTGCTGAACA TCGCCAACTT GATGAACTCG ATCAATACGG GTTCCTACAA CTTCGTCGAC CCGTCGAAGA ACACCCAGGC GGTTCGCGAC TTCATCGCGC CGGACAAGAC CACCCCGTCG CACTCGTCGA TGATGTCGTT GGACGGCTTG ATCACCAAGT CGCTCTGGAC CCTGCCGGGC GGTGACCTGC AGGTCGGCGT CGGCGCCCAG ATCCGCAAGG AAGTGCTGGT CAACAACAAC CAGAACGTCC GTCTGGACAC CTACGGCCTG ACGACGGCCT CGGCGTTCGG CAAACACACG GTCAAGGCCG CGTTCTTCGA AGTCAACGCC CCGGTCCTTG AACAGCTTGA GCTGAACGTC TCGGGCCGTT ACGATGACTA TTCGGAAGGC TTCAGCCACT TCTCGCCGAA GTTTGGCGTC AAGTACACGC CGATCAAGCA ACTGGCGTTC CGTGGCACCT TCTCGAAGGG CTTCCGCGCC CCGACCTTCG CCGAGTCCGG CCCGCGTTCG CAATACGCCG GCTTCGTGAG CACCACGCCG CCGGCCGCCT TCGTGAACGC CCACGGCACC TCGAGCGCCA ACAATCCGTA TGCCCAGCAA TACAGCCTGG GCCGCGGCGT GGCCGGCAAC CCGAACCTGA AGCCGGAAAC CTCGCGCAGC TTCACCATCG GCGCCATCGC CGAGCCGACC AGCTGGCTCA GCCTTACGGT CGACTACTAC AACGTGAAGA AGTCTGACCT GATCACCTCG GGTCCCGATA TCAGCAAGGC GGTTGCGGCC TACTACGGCC AGACCACCCA GGCGGCCGGC TGCGCCGCTA TCGCGGCAGG TTATCCGGGG TACTCGTGCA ATGTGGTGGA CGCCGTCGAC CCGTTGTATC CGACCGCTCA GCCGCGCGTG CTGATCATCA ACGTCCCGTA TGTGAACGCA AACTACGCGA TCACTTCGGG CGTGGATTTT GCGGCCACCG CCAAGGTCCC GGTCACGGAC AACATCAAGT GGACCAGCCG CGTTGAAGTC ACTCACCTGC TGAAGTACGA CCTGCACACC TCCACGGAAG TGCAGAAATA CGCCGGCACC CTGGGTCCGT ACGATCTGTC GTCGGGCAAC GGTACGCCGG ACTGGAAGGG CAACTGGCAG AACACGGTGG ACTTCGGTCG CTACACCGTG TCGGCAACGG CCTACTATGT GGGTTCGATC AAATCGGTCG CCGCCGACAC CAACGGCAGC ACCGACTGCC CGAAGGGCAA CCCCTACGGT GGCGCGGCCA ACCCGGCTGC CGCCAACAAG TTCTGTAAGA TCAAGAGCTT CGTTAATGTC GATCTGAACG GCACGATGCA GTTGAACGAC GGCGTCCAGC TGTACGGCAA CGTCGGCAAC CTGTTCGACG AACGGGCGCC GATCGCGCCG GGCGCTTACG CCAGCGCGCC GAACTTCCTG ACCACCTTCC ACTATGCCGG CCTGATCGGC CGGACGTTCA AGGTGGGTGT CCGCTTCCAG TACTAA
|
Protein sequence | MNIKSVRERL LASTMICGAA LATLAASPVS AQTTPAPAAD EVEEIVVTGS LFRRTDTETP SPVTVLTSEN LQRAGISTAS DAIRSISADG AGSIGTGFQS GFSAGGSAVS LRGLGVSSTL VLVDGLRSAN FPINDDGHNA YVDLNSIPFS LIDSIEVLKD GASSSYGADA IGGVVNLKLK KQFVGVKGSA EVGQSDRSDA EHRRADVTLG YGDYAETGWN FYVNAEYQKD DRVTSHSRGF PFNTQDLRSI GGLDLNTADS SLTTATPNAV VVRTTQTDLN NPLAGGASLI PSGTYVDGDG KTQNYSNYTT LNTNCANGPY TSTSLSARGS GCKWDLVDTY RQIQPLQKRY AFNGRLSMRL NENIEAYATG SYSNSYVSIK GAPTAVRATQ PFGGAPSLAS SNPGIVLPVY VCTSGINCAT AGAPGQRLNP NNPYAAAFAN DPANGAARLY YLFGDIPAGS ERSNEVIRGT FGLKGSFGDD WNWSVDAAGA RDNLKITQHG LLNIANLMNS INTGSYNFVD PSKNTQAVRD FIAPDKTTPS HSSMMSLDGL ITKSLWTLPG GDLQVGVGAQ IRKEVLVNNN QNVRLDTYGL TTASAFGKHT VKAAFFEVNA PVLEQLELNV SGRYDDYSEG FSHFSPKFGV KYTPIKQLAF RGTFSKGFRA PTFAESGPRS QYAGFVSTTP PAAFVNAHGT SSANNPYAQQ YSLGRGVAGN PNLKPETSRS FTIGAIAEPT SWLSLTVDYY NVKKSDLITS GPDISKAVAA YYGQTTQAAG CAAIAAGYPG YSCNVVDAVD PLYPTAQPRV LIINVPYVNA NYAITSGVDF AATAKVPVTD NIKWTSRVEV THLLKYDLHT STEVQKYAGT LGPYDLSSGN GTPDWKGNWQ NTVDFGRYTV SATAYYVGSI KSVAADTNGS TDCPKGNPYG GAANPAAANK FCKIKSFVNV DLNGTMQLND GVQLYGNVGN LFDERAPIAP GAYASAPNFL TTFHYAGLIG RTFKVGVRFQ Y
|
| |