Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2073 |
Symbol | |
ID | 5899528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2217229 |
End bp | 2220231 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562562 |
Product | TonB-dependent receptor |
Protein accession | YP_001683699 |
Protein GI | 167646036 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.168139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.78272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCTT TAGGTCGGAC GCGACGGGGA TCGCCCTTTT TGAACGTGTT GTTCGCGACG AGCGCGGTCG CGATGATCGT ATCGGCTGGA TCGGCCGTCG CGCAGACGGC GCCGCCGCCC GCGGAGCCGG ACGCCACGGC CTTGACCGAG ATCGTGGTCA CGGGGACCCG CATCCGGTCC ACCGGCTTCA CCGCGCCCAC GCCGACTCAG GTCCTGGGCC AGGCCGACCT GGAGCGCAAC GCCGAGCCGA ACGTCTTCAC GACCATCGCC CAGCTGCCTT CCTTGCAAGG GTCAACGGGG GCGACCACGG GCACGTTCAG CACGTCGAGC GGCCAGCAGG GCCTGAGCTC GTTCTCGCTG CGAGGCCTGG GCACGATCCG GACGCTGACC CTGCTGGACG GCCAACGGGT GGTTCCGGCC AACATCACGG GTGTTCCTGA CATCAGCCTG TTCCCGCAAT TGCTGGTCGA GCGGGTCGAT GTCGTGACTG GCGGCGCATC CGCCTCCTAC GGCTCCGACG CGGTCGGCGG CGTCGTCAAC TTCATCACCA ACAAGCGCTT CGAAGGCTTC AAGGCCAATG TGTCGGCCGG GGTCACGACC TATGGCGACA ACGAGCAGTT CCTGCTGCAG GTCGCGGGCG GCAAGGCCTT CATGAACGAT CGCCTGCATG TCCAGGTCAG CGGCGAATAC GACGAGGAGC AGGGCGTTCC CGCCGGCGGC TTCGGTGAGG ACGCGCCTGG CAGCCGTGAC TGGTACACGA CCGCCACCCT GGTCAATCGC GGCGTCACCA ACGACGGCTC GCCGCAATAC CTCTACCGCG AGCACGCCCA AGCTTATCAG TACACCAAGT ACGGCCTGAT CAGCGCTGGT CCGCTGCAAG GCACCGCCTT CGACCAGGCC GGCAACCCCT TCCAGTTCCA GTACGGCTCC AATGGCGTGC CGTCCAAGGC CGCCAACGGC GCGGTGATCG GTTGCTACTC GAACGGCGGC TTCTGCGTCG GCGGCGACCT GTCGGGCAAT GTCGGCGTCG GCACGTCGCT GCAATCCGAA ATCACGCGGA TGAACAGCTA CGGCCGCGTC GCCTACGACA TCGACGACAA CAACGAGATC TACGGCACGC TGAACATCGC CCGCGTCGAG TCCGCCAACC AGCCCAATCC TGGCGGCGCC ACGACCGGCC TGACGATGCA GTGCTCCAAT CCCTATCTGC CGGCGTCCGT CGTGGCCGCC TGCGCCGCCA ACAACATCAC CAGCTTCACC TTCGGCACCA GCAACGCCCT GCTGCCCAAC AATATCTCGG TCCATCCCAC GCGGACGCAA TATCGCGGCG TGATCGGCGC AGACGGCAAG TTCGCCGCCT TCGGCACGGA TTGGCACTAC GACGCCTATT ACACGCACGG CGAGAACACC ACGAACATCC ACGTCAACAA CATCATGCTG ACCCCGCGCT ACAGGGCGGC CATCCAGGCC ACCCTGGTCA ATGGGCAGAT CGTCTGCGCC GACCCCGTGG CGCGGGCTAA TGGGTGCCAG CCGATCAACG TCTTCGGCGG GCAGCGACCG AGCGACGCGG CGCTGAAGTA CATCGAGCCC GAGAACGGTC CGTATCAGCA TTCGGTGCAG AAGCAGGACG TCGCCAGCAT CAACTTCAGC GGCGAGCCCA TCCAGGGCTG GGCCGGCCCG GTCGCCTTGG CCTTCGGCGC GGAATATCGC CGCGAGGCCT ACCACGTCCA GGGCGACTCC TACGGCGACG GCTCGGCGGC CTCGCCCTAC ACCACCGACT ATCCGGCCGA CCCGTTGCTC AACACGGCGG GCAACAACTG GTACGCGGGC AACTATCACT CGGGCGCCGG CAAATACACC GTCAAGGAGG CCTATGCGGA AGTGAACCTG CCGCTGGTCA ATTCGGACGC CGCCGGCAAG GCCAACCTGA ACTTCGCCGG TCGCTGGACC GACTACAGCA CCTCGGGCAC CGTCTATACC TGGAAGGTCG GCGGAACCTG GGACACGCCG ATCGATGGCG TCCGCCTGCG GGCGGTGACC TCGCGCGACG TGCGCGCGCC GAACCTGTCG GAACTCTACG CCGCGCCGGT CACCACCACC CTGCCCAACT TCACCATTCC TTCGAACGGC ACGCCCCCGC CGGGCCAACC CTCGGGTCCC GGTGCGCTGC TGGTCCTCCA GAACGTGGTC GGCAACCCGG ACTTGAAGCC CGAGATCGCC AAGAACACGA GCCTCGGCGT GGTGCTGGCG AACCCGAAAT GGCTGCCGGG CTTCAGCGTC TCCGTGGACT GGTACAACAT CGTGCTGAAT GGCGGTATTT CCAGCCTCAG CGCTCAGCAG GTGGTCAACT TCTGCTATTC GGGCCTGACG CAGTATTGCG GCAGCTTCAA CTTCGCCCCG CCGGCCGGCA CGACCCCCTA CGTCAACGCC CAGGTGTTCA ACCTGGCCTC GATCAAGACC AGCGGCTTCG ACATCGAGAC CAGCTACCGG TTCGAGATTC CCAGCGTTCC GGGCCGCTTC GACCTGCGCG CCCTGGCGAC CAACACCCAC AAGTTCGTCA CCAATCAGGG CTTCCCCGGC GGCTCCGTTG TCGATACCGC GGGCCAGAAT TCCGGGGCCA CGCCAGACTG GAAGGTGCTC GCCGTCCAGA GCTGGAGCTC GGACCGGTTC ATGCTCAGCC TGCAGGAACG CTGGTTCAGC GATGGCGTCA TCGGCGGCCA GTACATCGAG TGCGCCGCCG GCAGCTGCCC GGTGGGTAGA ACCGCAGCCG ATAACAACAA CTACCCGACC ATCGACCACA ACCAGATGAA GGGCGCCACC TATGTCGACG TGAGCGGCTC CTACAAGGTG ACCAAGAGCC TCCAGGCCTA TTTCAAGGTC GCCAACCTCT TCAACAAGGA CCCGACGCCG TCGCCGCAAA CCAACACCGG TCTTGACGCC AATCCGGCGC TCTACGACCT GCTGGGGCGG TTCTACCACG TGGGCCTGCG CTACAGCTTC TAG
|
Protein sequence | MSALGRTRRG SPFLNVLFAT SAVAMIVSAG SAVAQTAPPP AEPDATALTE IVVTGTRIRS TGFTAPTPTQ VLGQADLERN AEPNVFTTIA QLPSLQGSTG ATTGTFSTSS GQQGLSSFSL RGLGTIRTLT LLDGQRVVPA NITGVPDISL FPQLLVERVD VVTGGASASY GSDAVGGVVN FITNKRFEGF KANVSAGVTT YGDNEQFLLQ VAGGKAFMND RLHVQVSGEY DEEQGVPAGG FGEDAPGSRD WYTTATLVNR GVTNDGSPQY LYREHAQAYQ YTKYGLISAG PLQGTAFDQA GNPFQFQYGS NGVPSKAANG AVIGCYSNGG FCVGGDLSGN VGVGTSLQSE ITRMNSYGRV AYDIDDNNEI YGTLNIARVE SANQPNPGGA TTGLTMQCSN PYLPASVVAA CAANNITSFT FGTSNALLPN NISVHPTRTQ YRGVIGADGK FAAFGTDWHY DAYYTHGENT TNIHVNNIML TPRYRAAIQA TLVNGQIVCA DPVARANGCQ PINVFGGQRP SDAALKYIEP ENGPYQHSVQ KQDVASINFS GEPIQGWAGP VALAFGAEYR REAYHVQGDS YGDGSAASPY TTDYPADPLL NTAGNNWYAG NYHSGAGKYT VKEAYAEVNL PLVNSDAAGK ANLNFAGRWT DYSTSGTVYT WKVGGTWDTP IDGVRLRAVT SRDVRAPNLS ELYAAPVTTT LPNFTIPSNG TPPPGQPSGP GALLVLQNVV GNPDLKPEIA KNTSLGVVLA NPKWLPGFSV SVDWYNIVLN GGISSLSAQQ VVNFCYSGLT QYCGSFNFAP PAGTTPYVNA QVFNLASIKT SGFDIETSYR FEIPSVPGRF DLRALATNTH KFVTNQGFPG GSVVDTAGQN SGATPDWKVL AVQSWSSDRF MLSLQERWFS DGVIGGQYIE CAAGSCPVGR TAADNNNYPT IDHNQMKGAT YVDVSGSYKV TKSLQAYFKV ANLFNKDPTP SPQTNTGLDA NPALYDLLGR FYHVGLRYSF
|
| |