Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4082 |
Symbol | |
ID | 5901544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4424137 |
End bp | 4426851 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564602 |
Product | TonB-dependent receptor |
Protein accession | YP_001685704 |
Protein GI | 167648041 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTCTC ACTACAAGCT ACTCGCCGCG GTCAGTGGAC TGGCCCTGAT GGCGGCCGCC GGCGCCCACG CGCAAACGGC CGCGCCCGCC AACGACGCCG CGCAGGTCGA CGAAGTCGTC GTCACCGGCG TCCGGAAGAG CCTGCGCGAC GCCCTGCAAG TGAAGCAGGG CTCGGACAAG GTGGTCGAGG CCATCTCGGC CAAGGACATC GGCGTGCTGC CCGACGTCAC CATCGCCGAA TCCATCGCCC GCCTGCCCGG CGTCAACGCC ACCCGCGACC GCGGCAACGA CAGCCAGGCC GTCGTGCGCG GCCTGGGCGC GCGCCTGGTG CTGGGCACCA TCAACGGCCG CGAAGTCGCC TCGTCAGAGC CCGACCGCAA CGTGCGCTGG GAAATCTACC CTTCCGAAGT CGTCCAGGGC GTCCAGGTCT ACAAGTCGCA GTCGGCCGAC CTGATCGCCG GTGGCGTGGC CGCGACCATC AACATCGACA CCATCGCCCC GCTCGACTAT CGCGGTCCCA GCGTCGTGCT GCGCGCCGGC CCAGTCTATT ATGACGGCGG CAAGGACATC CCCAACTACG GCCAGACCGG CTACCGGGCC AGCGGCTCGT TCGTGCACAA GTTCAACGAC GACCTGGCCA TCGTGCTGGG CCTGACCAGC CAGAAGCAGA AGAACGGCTA CACCTCGTTC CAGGGCTGGG GCTACAACGA CTCGGTGATG CGCCAGCCCG CTGGGGCTAG CGACTACAGC GGCGACCTGA ACGGCGACGG CAAGGTCGAT CCCACCCCGT GGGGCATGCA GCTGGAGATC AAGAAGATCG ACCAGAAGCG CAACGGCGTG TCGACCGGCC TGCAATGGAA GCCGACCGAC CACTTCGAGC TGAAGGCCGA CGTCCTCTAT TCCGACATCA AGATCACGGA AAACCAGGAC CAGCAGATCT ACGCCCAGAA CTACGGCAAC TGGAACAACG GCAATGCCTT CGACTGGCAG GGTAATCCCA TCGGCTACAA CGCCCCGGGC GCGTCCTACA CCCTGGTCAA CGGCGACGTG GTCGCCGCCA CCCTGCCCGG CGCCGCCGTC ACCTCGGTGA TCGCGCGCTA TACCGAGGAC AAGAAGCTCT ATGCCGGCGG CCTGAACGGC AAGTGGACCA ACGACGCCTG GACCGTGGCC GGCGATGTCT CCTATTCGAA GGCCGAGCGG ACCAACAACT GGCGGGCTGT GCGCGCCGAG GTCTATCCGG CCTGGATGAC CTATGACACC CGCGCCGGCG TCAAGCCCAG CGTCACCACC TCGGAAGACC CAACCACCCT CGCCCAGGTG GCCCCCAGCT GGCGCGGCGG CCAGAACGAC GGGCTTGAGC ACCTGAACGA TGAGTTGAAG GCCGGCGCCC TGGACTTCAC CCGCGACTTC GGCGGCGGGG CGTTCAAGAG CTTCCAGTTC GGCGCGCGCT ATTCAGACCG GGTGAAGGAT CACGACCAGG CCAGTTGGTC CACCTGCCCC AACCCGACCA ACACGGCCAA CCTGGCCGGC CTGAAGGACC AGTGGGGCAA CTGCCTCTAC TCGGTCACCC TGCCGGCCAG CCTGTTCAGC ACCTACAAGA TCGGCAGCTT CAACGTGCCG AGCATTTTGA CCGGCGACCT GGACGCCATC GCCAAGGCCG CCTACGGCGA CCACGGCTTC GACGCCGCCA ACGCTGTCGA CAACCTGGCC CAGCGCTGGC GCGTCCACGA GAAGGTGGCC GAGGCCTATG GCAAGCTGAA CTTCGCCGCC GACGGCGTCG CCGGCGCCTG GATGACCGGC AATGTCGGCG TCCGCGTGGT CAGCACCAAG ACCGACAGCG AGGGCTATCG CCAGGACCCG GGCCTGGCGA CCTTCTCGGC CGTCTCGGTC AAGGCTGACT ATACCGACGT GCTGCCCAGC GCGAACGTCA AGCTGGACTT CGACCAGGGC CGCGTGCTGC GCTTCGGCCT GGCCCAGGTG GTGGCCCGCC CGCCGCTCGA CGAGCTGCGC GCCAGCCGCA CCCTGACCAC CTGGTCGCCC TATACCGGCT CGGCCGGCAA CCCGAACCTC AAGCCGTTCA AGGCCATCCA GTTCGACGCC TCGGCCGAGT GGTACTTCCG TCCCGAGAGC CTGGTGGCCG CGTCCTACTA CTACAAGGAC GTCGATACCT ATATCGGCTG GAAGCAGACG CCCGAGACCT ACAACGGGAT CACCTACGCG GTGTCGAGCC CGGTCAATGG CGGCGGCGGC TACATCCAGG GCCTGGAGCT GACCTTCCAG ACGCCGTTCT TCTTCCTGCC GGGACCGCTG AGCAAGTTCG GGATCTATTC GAACTACGCC TATGTCGACT CCGACCTGAA GGAGTTCCAG CCGGTCACCA AGCCGCTGTC CCTGACGGGC CTGGCCAAGG ACACCGTGAC CCTGGACCTG TGGTACGCCA ACGGCCCGAT CGAAGGCCGC ATCGGCTACA AGTACCACAG TCCGATGACC GTGATTTACG GCTGGAGCGG CGCGGACCTG CAGACCCTGG AGTCGGCAAG CACGGTCGAC TTCAGCTCGT CCTACCAGGT CACCGACAAG ATCGGCCTGC GCTTCCAGGT CAACAACCTG ACCAATGAGC GCCTGCGGAT GTATCGCGAC AACAAGCCCG ACCGCCTGGG TCGCTACGAC CTTTACGGCC GCCGCTTCCT GTTCGACGTG ACGGTGAAGT TCTAG
|
Protein sequence | MMSHYKLLAA VSGLALMAAA GAHAQTAAPA NDAAQVDEVV VTGVRKSLRD ALQVKQGSDK VVEAISAKDI GVLPDVTIAE SIARLPGVNA TRDRGNDSQA VVRGLGARLV LGTINGREVA SSEPDRNVRW EIYPSEVVQG VQVYKSQSAD LIAGGVAATI NIDTIAPLDY RGPSVVLRAG PVYYDGGKDI PNYGQTGYRA SGSFVHKFND DLAIVLGLTS QKQKNGYTSF QGWGYNDSVM RQPAGASDYS GDLNGDGKVD PTPWGMQLEI KKIDQKRNGV STGLQWKPTD HFELKADVLY SDIKITENQD QQIYAQNYGN WNNGNAFDWQ GNPIGYNAPG ASYTLVNGDV VAATLPGAAV TSVIARYTED KKLYAGGLNG KWTNDAWTVA GDVSYSKAER TNNWRAVRAE VYPAWMTYDT RAGVKPSVTT SEDPTTLAQV APSWRGGQND GLEHLNDELK AGALDFTRDF GGGAFKSFQF GARYSDRVKD HDQASWSTCP NPTNTANLAG LKDQWGNCLY SVTLPASLFS TYKIGSFNVP SILTGDLDAI AKAAYGDHGF DAANAVDNLA QRWRVHEKVA EAYGKLNFAA DGVAGAWMTG NVGVRVVSTK TDSEGYRQDP GLATFSAVSV KADYTDVLPS ANVKLDFDQG RVLRFGLAQV VARPPLDELR ASRTLTTWSP YTGSAGNPNL KPFKAIQFDA SAEWYFRPES LVAASYYYKD VDTYIGWKQT PETYNGITYA VSSPVNGGGG YIQGLELTFQ TPFFFLPGPL SKFGIYSNYA YVDSDLKEFQ PVTKPLSLTG LAKDTVTLDL WYANGPIEGR IGYKYHSPMT VIYGWSGADL QTLESASTVD FSSSYQVTDK IGLRFQVNNL TNERLRMYRD NKPDRLGRYD LYGRRFLFDV TVKF
|
| |