Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2243 |
Symbol | |
ID | 5899698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2437597 |
End bp | 2440599 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562734 |
Product | TonB-dependent receptor |
Protein accession | YP_001683868 |
Protein GI | 167646205 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.212076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCAG TTCTTGGGGC GCTTTTGATC GGCGCCAGCG TCGCGATCCT GGCGGCAGCC GCCACGCCAA CCAGCGCGAC GGCGGCAGAG CCGACCATAG CGCTCAATCT GCCCGCCGGC CCGATGCAGA AGTCTCTGGT CGCGCTCGCG ACCCAGGCCG ACGTCAAGAT CCTGTTCGAG ATGGATCTCG TCGCGGGGCT GACCGCGCCG GCGCTTCAGG GTCAGTTCAC GCCGCGCCAA GCGGTCGAAA GGCTGCTCGC TGGAAGCGGC GTCGCCGTCG ACCAAGTCCG ACCGGGCGTT TTGGTTCTGC GACCCGCGCG CTTGGGCGCG AGCGCAGAGG CTGCGGCTTT TCCGGGCGGC GCGCTCGGCG GGGAACCGGC GCAAGCCGAC GAAACCCTGC TGTCGGAGGT CGTGGTCGGC AGCCATATCC GTGGCGCTCA GGGCGCCTCG CCCATCGTCA CTTTCGACCG GAACGCTATT GATCAAGGCG GCTACGCGAC GCTCGCCGAC GCCTTGACGG CCCTGCCCCA GGCCTTCGGT GGCAGCGTGT CGGACGACAC CGGCGCGACG GGCGCCGACA CCACGGGCGT CAACACCGCC CGTGCGACCG CCGTCAATCT GCGGGGCCTT GGCGCGGACT CGACGCTGGT GCTGGTGGAC GGCCGGCGCA TGGCCGGCGC GGGACTGAAG GGCGACTTCG CCGACGTCTC CAGCCTGCCG CTGGTCGCGG TGGAACGCGT GGAGGTGCTG CTCGACGGAG CCTCGGCCCT CTATGGCTCT GACGCCGTCG GCGGTGTCGT CAACATCGTC ATGCGCAAGG ACTACGAAGG CGCCGAGACG CGGCTGACCG CCGGGGGGTC CACGCGCGGC GACCTGCGCC AGGTGTCGAT CGCCCAGACC TTCGGGACCC GCTGGGCAAG CGGCCATGCG TTGATCTCCT ACGAGCACCA GGACCGCGAG GCGTTGGCGG GACGCCGGCG CTGGTACGCC GGCCAAACGG ACCTTCGACC CTGGGGTGGA ACCGACCAGC GGCGGTACTA CGCCAAACCG GGGACCGTGG TGTCGTTCGA CCCGGTCAGC GGCGCCCTGG CGCCGGCCTA TGCGATCCCG AACAGCGCCC CGGGAACCGT GCTGCGCGCC AGCGACTTCA CCGCGGGGCA AAATCTCGAG AATTGGCGGG CCGGATACGA TGTCCTCCCG GCCCAACGTC GCGACAGCGT CTTTCTCGCC GCGAGCCAGG ATCTTGGCGC GCAGGTCACG GTCTCCGGCG ACCTGCGCTA CTCCGACCGG CGCTTTCACG CCACGGGCTT GGCGTCTGAT AGCCTGATCT TCGTCACCCC CGATAACCCC TGGTACGCCT CGCCGACGAA CGCTCCTTCT GAGATCGTCG CCTACTCCTT TCTCGACGAA TTGGGCGGCG TGCGCAGCCG CGGCTCGGTG CGCAGCCTCG CCGCCTCGGT CGGGCTTGAG GCACGCCTTC CACACGACTG GCGGCTGACG ACCTATGTCG CGCACGCCGA GGACCTGTCG CACACCCGCG GCGACAACGT CGTCAATCCC ACCCTGTTGG ACGAGGCCTT GGGGGCCACG CCCGACGATC CGGCCACGGC CTTCAGCGCC GCGCGCGACG GCTATTTCAA TCCCTTCATC GGCCAGGGCG CCAACAGCAG GACCGTGCTC GACTTCATCA GATCCGGCTA CGAGACGCGC CGCACGCTGG GGGAGACCGA CAGTTTCAGT CTGCAAGCGG ACGGCGCTCT GGCCACCCTG CCCGGCGGAC CTTTGCAGGC GGCCGTCGGC GTGCAATTTC GCCGCGAGCG TCTGGACACC GGCGGCACGA GCTTCGTCGG CGGAACCGCG CCGCGCGCCG GCTTTTCGCG AAAAGGCGAG CGCACCGTCA GCGCGGGTTT TGTCGAGCTG CGGGTTCCGC TGGTCGGCGA TGCAAACCGC CGGGCCGGCA TCGAGCGCCT GGAGCTTTCG GCCGCCGGGC GGATCGAGTC CTACGATGAC GTGGGGACCA GCACCGTTCC GAAGTTCGGC TTGGTGTGGA AGCCGATTGG CGACCTCACG GTTCGAGGCA CCTACGGCCG GGCCTTTCGC GCGCCCTCGC TGGGGGAACT AAACGACAGG TTCCTGATCA CCCCGGTGTT CCTGACGCGT GGCGCTGACA CCGTGCTCAG CCTGCTGCTG TTCGGGGGCA ATCCCGAGCT CAAACCCGAG ACCGCCAAGA CTTGGACGGC GGGCTTTGAC TGGACGCCAC AGGCCCTGCC CGGCCTGAAG GTGTCGGCGT CAACCTTCGA AACTCGCTTC AAGGACCGTA TCGGCCAACC GGCCAATGAT AATCTTGGCA TCGTGCTGAC GGCTGACGAG TTCGCCGCCT TTCGCCGGTT CGTCGATCCG GCGGGCAACG CCAGCGATTT GGCGCTCGTG CAAGGGCTGA TCGACGACCC GGCTTCGCGC GCCAAGGGCC TCTTTCCGGC CGTCGCCTAC GGCGCGATCG CCGACGCGCG CTATGTCAAC ACCGCCGCCC TCACGGTGCG GGGTGTCGAC CTGTCAGCGC GCTACGGCCT GAGCCTCCAC GGCGATCCTC TTGATCTGGA CGCCAGTCTG ACCTGGCTGA CGGATTTCAA GCGGCAGACA ACCGCCGCGG CGCGGCCCGT CGATCTCGCC GGGCAGACCG GATCCCCAGC CGATCTACGC CTTCGCCTCA CCGCGACCTG GACCCATGGC CCGCTGGCGG CCACCGGCAC GGTAAACCGG GTGGGCGATC TTCAAGCCGA AACCGGCGAA CGCGTGGCGT CCTGGACGAC GGTCGACGCC CAGGTCCGCT GGACCGCGGC TGCAAACAGC CGGCTGGAAG GCCTGACCGC CGCGCTGAGC GTCACCAACC TCTTCGACCG CGATCCGCCG TTCTACAACT CCCCTCTAGG CCTGGGCTAC GACCCGGCCA ACGCGGACCC GGGCGGCCGT CGAGTGAGCC TTCAGCTCAC CAAGGCCTGG TAG
|
Protein sequence | MKPVLGALLI GASVAILAAA ATPTSATAAE PTIALNLPAG PMQKSLVALA TQADVKILFE MDLVAGLTAP ALQGQFTPRQ AVERLLAGSG VAVDQVRPGV LVLRPARLGA SAEAAAFPGG ALGGEPAQAD ETLLSEVVVG SHIRGAQGAS PIVTFDRNAI DQGGYATLAD ALTALPQAFG GSVSDDTGAT GADTTGVNTA RATAVNLRGL GADSTLVLVD GRRMAGAGLK GDFADVSSLP LVAVERVEVL LDGASALYGS DAVGGVVNIV MRKDYEGAET RLTAGGSTRG DLRQVSIAQT FGTRWASGHA LISYEHQDRE ALAGRRRWYA GQTDLRPWGG TDQRRYYAKP GTVVSFDPVS GALAPAYAIP NSAPGTVLRA SDFTAGQNLE NWRAGYDVLP AQRRDSVFLA ASQDLGAQVT VSGDLRYSDR RFHATGLASD SLIFVTPDNP WYASPTNAPS EIVAYSFLDE LGGVRSRGSV RSLAASVGLE ARLPHDWRLT TYVAHAEDLS HTRGDNVVNP TLLDEALGAT PDDPATAFSA ARDGYFNPFI GQGANSRTVL DFIRSGYETR RTLGETDSFS LQADGALATL PGGPLQAAVG VQFRRERLDT GGTSFVGGTA PRAGFSRKGE RTVSAGFVEL RVPLVGDANR RAGIERLELS AAGRIESYDD VGTSTVPKFG LVWKPIGDLT VRGTYGRAFR APSLGELNDR FLITPVFLTR GADTVLSLLL FGGNPELKPE TAKTWTAGFD WTPQALPGLK VSASTFETRF KDRIGQPAND NLGIVLTADE FAAFRRFVDP AGNASDLALV QGLIDDPASR AKGLFPAVAY GAIADARYVN TAALTVRGVD LSARYGLSLH GDPLDLDASL TWLTDFKRQT TAAARPVDLA GQTGSPADLR LRLTATWTHG PLAATGTVNR VGDLQAETGE RVASWTTVDA QVRWTAAANS RLEGLTAALS VTNLFDRDPP FYNSPLGLGY DPANADPGGR RVSLQLTKAW
|
| |