Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3715 |
Symbol | |
ID | 5901171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4010073 |
End bp | 4012268 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641564226 |
Product | TonB-dependent receptor |
Protein accession | YP_001685340 |
Protein GI | 167647677 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.62728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCT CCAACAAGCT GCGCAGCCGC CTCCACCGCG GCGCCGCGCT CACCGTCCTG GCCCTCGCCG TCGGCGTCGC CGTCCCGGCC TTCGCCCAGG ATACGGGCAA CGTCACCCTG GATGACGTGA TCGTCACCGC CCAGAAGCGG TCGGAAAACG TCCAGGAAAT CCCCGTTTCG GTCGCGACCA TGTCCGGTGA AAAGCTGGGC GACGTGCTGG CCGCCGGCGA GGACATCGTG GGTCTGTCGA GCCGCGTGCC TGGCCTGTAC ATCGAATCCT CGAATGGCCG CGCCGCGCCG CGCTTCTACA TCCGGGGCCT CGGCAACGTC GACTTCGACC TGGCCGCCTC GCAGCCCGTT TCAGTGATCA TCGACGACGT GGTCATGGAA AACGTGGTGC TCAAGAGCAC CCCGATCTTC GACGTCCAGC AGGTCGAAAT CCTGCGCGGT CCGCAAGGCA CGCTGTTTGG CCGCAACACC ACCGCCGGCA TCGTCAAGTT CGACTCGGTC AAGCCGAGCC AGGACTTCTC GGCAACCGGC ACGGCCACCT ACGGCACCTA TGGCACCGCC ACGTTCGACG GCGGAGTCGG CGGCGCGTTG GTCAAGGACG TCCTGTCCGG CCGCCTGTCG GTGTTGGCCC AGCACCGCAA CGACTACATC GACAATGGCT TCAATGGCGA AAAAGACGCC TTGGGCGGCT ATGACGAATA CGCCATCCGC GGCCAGTTGC TCTATACGCC GACCGACAAG TTCAGCGCCC TGCTGAACCT GCACAACCGC TCGCTGGACG GCACCGCCGC GATCTTCCGC GCCAACATCC TCACCACCGG CAGCAACAAG ATCAACGGCA ACTTCAAACG CGACAAGGTC TTCTACAACG GCGGCAACAA CAACCCGCAG AAGTTCGACG GCAACGGCGC GTCGCTGAAG ATGGACTACG ATCTGGGCGG CGCCAAGCTG ACCTCGATCT CGGCCTACGA GACCACCAAC GGCTATAGCC GCGGCGACAT CGACGGCGGC GTGGCCGGGG TCGGCCCGGG CTTCATCCCG TTCGACTCCG CCTCGGCGGA CGCCATCGAC CTGGACCAGT ACACGCAGGA AATCCGCCTC GCCAGCGACG ACGCTTCGCC GCTGACCTGG CAGGTGGGCG CCTACTACTT CAAGTCCAAG TTCTCGGTGG CCAGCGATCC CGGCTTCGCG CCGCCCTCGA CCATCGAGCA CAAGAACACG GCTTGGGCGG TGTTCGGCCA AGCCTCTTAC AAGATCTCTG ACGACCTGAA GATCACCGGT GGCCTGCGCT ACACCAGCGA CGACAAGGAC ATGTCGGTGT TGAGCTCGCC GTTCGGCATC CCCGCGCCGG TCTCGGTGTC GGACGAAAAG GTCAGCTGGG ACCTGTCGGC CTTTTACGAC GTCGCGCCCG ACGTCAGCCT CTATGCCAAG GTCGCCTCCG GCTTCCGCGG TCCGTCGATC CAGGGCCGCG ACATCGCCTT CGGCAGCGCC TCCTCGATCG CCCAGTCGGA AACGATCATG TCGTACGAGG CGGGCCTGAA GAGCGAGCTG CTCGACCGCC GCGTGCGCCT GAACGGCGCG GTGTTCGCCT ATGAGGTCAA GGACCTCCAG CTCAGCGCCA TCGGCGGCGG CAGCAACTCC AACCGCCTGA TCAACGCCGA CAAGGGCCAA GCCTACGGCT TCGAACTCGA CGGCGAATGG GCGGTCAACG AGAACCTGCT GGTGACGGCC GGCTACAGCT ACAACCACAC TGAAATCAAG GACAGCGGGC TGACCACCGC CGCCTGCGGT TCGGGTCAGT GCACGGTGAC CGACCCGACC ACCACCGGCG GCCTGGCCCT GATCAACGGC AACCCGTTCC CGAACGCGCC GAAGTCGATC CTGACCTTCA CGGCCCGCTA CAGCTATCCG ATCGGCGACG GCGAACTGTT CGCCTACACC GACTGGTTCC GTCAGGGCTA CACCAACATC TTCCTGTACG AGAGCAAGGA GTACCACACC AACGGCGACT TCGAAGGCGG CCTGAAGCTG GGCTACGCCA AGTCCGACGG TGCGTATGAA GTGGCGCTGT TCGCCCGCAA CATCACCAAC GAGGTGAACC TGCGCGGCGG CATCGATTTC GACAACAACA CCGGCTTCGT CAACGAGCCC CGGATCGTCG GCATCTCGAT CAGCGCCAAG CGGTAA
|
Protein sequence | MTTSNKLRSR LHRGAALTVL ALAVGVAVPA FAQDTGNVTL DDVIVTAQKR SENVQEIPVS VATMSGEKLG DVLAAGEDIV GLSSRVPGLY IESSNGRAAP RFYIRGLGNV DFDLAASQPV SVIIDDVVME NVVLKSTPIF DVQQVEILRG PQGTLFGRNT TAGIVKFDSV KPSQDFSATG TATYGTYGTA TFDGGVGGAL VKDVLSGRLS VLAQHRNDYI DNGFNGEKDA LGGYDEYAIR GQLLYTPTDK FSALLNLHNR SLDGTAAIFR ANILTTGSNK INGNFKRDKV FYNGGNNNPQ KFDGNGASLK MDYDLGGAKL TSISAYETTN GYSRGDIDGG VAGVGPGFIP FDSASADAID LDQYTQEIRL ASDDASPLTW QVGAYYFKSK FSVASDPGFA PPSTIEHKNT AWAVFGQASY KISDDLKITG GLRYTSDDKD MSVLSSPFGI PAPVSVSDEK VSWDLSAFYD VAPDVSLYAK VASGFRGPSI QGRDIAFGSA SSIAQSETIM SYEAGLKSEL LDRRVRLNGA VFAYEVKDLQ LSAIGGGSNS NRLINADKGQ AYGFELDGEW AVNENLLVTA GYSYNHTEIK DSGLTTAACG SGQCTVTDPT TTGGLALING NPFPNAPKSI LTFTARYSYP IGDGELFAYT DWFRQGYTNI FLYESKEYHT NGDFEGGLKL GYAKSDGAYE VALFARNITN EVNLRGGIDF DNNTGFVNEP RIVGISISAK R
|
| |