Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1843 |
Symbol | |
ID | 5899298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1960245 |
End bp | 1963496 |
Gene Length | 3252 bp |
Protein Length | 1083 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562333 |
Product | TonB-dependent receptor |
Protein accession | YP_001683470 |
Protein GI | 167645807 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000198025 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000327731 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACTATG TCCAAGGCGC GTCCGCGCGT GCACGTTCAA CTTACAATTC AAGACTCAGG TACGGTGTTT CGGCCCTCGC GCTCGGCGCC GTGCTGATCG GAGCGCCGGC CCTGGCTCAG ACCAAGCCGA CCGATGACGC GCAGACGGTC GACGAGGTTA TCGTCACCAG CATCCGCCAG AGCCTGAAGA GCTCGCAGCA GCTCAAGCAG AGCTCCGAGA TCATCGGCGA CTCGATCACC GCCGAGGACA TCGGCGCCCT GCCGGACCGT TCGGTCACCG AGGCGCTGCA ACGCATCCCG GGCGTGGCCA TCAACCGCTT CGCCGCGGGC GTCGATCCCG ACCACTTCTC GGCCGAAGGC AGCGGCGTCG TCGTGCGCGG CCTGAACTTC GTGCGTTCCG AGCTGAACGG CCGCGACACC TTCTCGGCCA ACAACGGCCG GGCCCTGAGC TTCGCGGACG TGCCGTCGGA GCTGATGGGC GGCGTCGACG TGTTCAAGAG CCCCTCAGCC GACATGATCG AAGGCGGCAT CTCCGGCACC GTCAACCTGC GCACCCGTCT GCCGTTCGAC AGCAAGAAGC GTCTGTTGTC GCTGTCGGCT GAAGAGAGCT ACGGCGATTT CGTCAAGAAG TGGGCGCCCA CCTATTCGGC GCTCTACAGC GACCAGTGGG ATACCGAGGC CGGTACGTTC GGCCTGCTGC TCAGCGCCGT CGACTCCAAG CTGTGGACCC GTTCGGACGG CACCCAGGTG TCGAACTTCG GTTGCCGCAC CAACTTCACC AGCGCCCAGA CCGCCAATCC GCAGGCCGTC ACCTGCCCGC AAGGCGGCAA GGGCGTGTGG TTCCCGCGCG GCGCGGCCTT CCGCAGCACC GAGACCGAGC GCGAGCGCAT CGGTTACGCG GCGGCCGGCC AATGGCGCAG CAACGACGAC ACCATGCTGG CCACCTTCCA GTACCTGCGT TCGGAATCGC AGCAGTCCTG GACCGAGCAC GCGATGGAAA TCGCCACCGA CAACGTCCTG GCGGCGGGCG ATTCGCGTCC GATCGACGGC ACCACCTTCG GCGTTGACAG CAACGGCATC TTCACCAACG GGATCATCAC CGGTCCCCAG GGTTGGAGAG ACGACCAGAA CAGCGCCGAC CCGCGCACGC CCAGCTTCGG CCTGCAGAGC AACAACATCT CTCGCAGCGT CGAGCAGAAG TATGTGACCT CGGACTACGG CTTCAACTTC AAGTGGACGC CGACTGATCG CATCGGGGTG GCCTTCGACT ACCAGCACGT CGACTCGACG GTCGACAACC TGGACGTCGG CATCTGGGGT TCGAGCTTCC AGAACCTGGA TCTGAAGCTC AACGGCTCCG ACATGCCGGT GTTCAGCTTC ATCCCGCCGG CCAGCGGCGC GTCGATCCCG CAATGCTCAC CGCCCAGCGG CAGTTGCTCG ACCTACCTCC GCGCGCCGTA CGACCACTTC CAGAACCCGC ATAACAGCTT CTGGCGTTCG GCGATGGACC ACATCGAGCA GAGCGAAGGC AAGGAAGACG CCGCCAAGAT CGATGTCGAC TACCGTTTCG CGGACGACAG CAGCTGGCTG GATTCGGCCC GCGTGGGCGT GCGCTGGGCC GAGCGCGACC AGACCACCCG CTTCTCGACC TATAACTGGG GCGTGCTGAG CGAAATCTGG GGCGGCGGCG GTCCGGTGTG GTTCGACGAT CCGGTCAACG GCAATCCGGC TACGGCAGGC GGCGAGACCA GCGTGGCGCG CACCGAACTC TATCCGTTCA CCGACTTCAT GCGCGGTCAG GTCCCAGCTC CGACCGGCCT GGACGCCCGT CCGTTCTACC TCGGCAACAC CGCCACGGAC TATGCGGGTC TCCAGGCGTT CGCCCTGAAG ATCGGCGACG AGTGGCGCCC GCGCGTCGCG GCGGGCTCAA CCTGCCCGCA GAACTGGGTC CCTCTGGCCC AGCGCTGCAA TACGGTCGCC GGAACGCCTT TCCTGCCGGG TGAAATCAAC CCGATCAACG AGAAGACCAA GTCGGCCTAC GCCATGCTGC GCTTCAAGCA TGAGTTCGAC GGCGACGTTA AGGTCACGGG CAATATCGGC CTGCGCTACA CCAGCACCAC CCGGGATGCG ACAGGCTTCC TGACCTTCCC GAACACGGTC CCGGCGACCG ATGCCTCGTG CGATCTCTCC TTCACGAACT GGCAGGCCCA GCCGGATCCC AAGGATCCCT TCGTGCCCTC GGCGTTCTGC GCCCTTTCGC CCACCGCTCG CCAGAGCGTG CGCAACTTCA ACAACGGCGC CACGGTCGCG CAATCCGCCC ACGCCAAGTT CACCTACTGG CTGCCCAGCG TGAACCTGAA GGTGGCCTTG AGCGACGGCT GGCAACTGCG CTTCGCCGCC TCCAAGACGA TCACCCCGCC GGAAGTCGGG CTGACGCGCA ACTATTACGA CGTCAAGCTC GACACTAACT CGACGGGCAT CATCAACGGC GTGGTCGGCG GCAACACCAC GGTCGGCAAC CCGTATCTGA AGCCCACGCA GTCGATCAAT ATCGACGGCT CGGCGGAATG GTACTTCGCG CCGGTCGGCT CGGTGACCCT CGCTCTGTTC TGGAAAGAAC TGACCGATGT GGCCACCAAC ACCACCGCGC GGATCCCGTT CACCAACAAC GGTTCGACCT TCGCCGTCGC GGTGACCACG CCCGGGAATT CGGACGTCAA GGCCCACGTC AAGGGCTTCG AGATCGCCTA CCAGCAGTTC TACGACTTCC TGCCCAAGCC GTTCGATGGG TTCGGCATCA ACGCCAACTA CGCCTATATC GACAGCAAGG GCGTGCCGCA AAGCACGCTG TCGGCCACCG ACCCCGACGT GGGCGCTGGC CGGGTCTCGA CCGTCGACAC TGGCCTCTTG CCGCTGCAAG GCCTGTCCAA GCACAACGTC AACTTCGCCG CCATCTATGA GAAGGGTCCG ATCTCGGCCC GTCTGGCCTA CAATTGGCGT TCGGACTTTC TGTTGACTGT CCGTGACGTG ATCGTGCCCT TCGCACCGAT CATGAACGAG GCTACCGGCC AGTTGGACGG ATCGCTGTTC TACACGATCA ATCCGAAGGT GAAGATCGGC GTGCAGGGCG TGAACCTGAC CAACGAGGTC ATCAAGACCA CCCAAGTGTT GAACAACGAC CTGCTCAAGG CCGGGCGGTC GTGGTTCATG AGCGACCGTC GCTACACCTT CGTGTTGCGC GCCAGCTTCT AG
|
Protein sequence | MDYVQGASAR ARSTYNSRLR YGVSALALGA VLIGAPALAQ TKPTDDAQTV DEVIVTSIRQ SLKSSQQLKQ SSEIIGDSIT AEDIGALPDR SVTEALQRIP GVAINRFAAG VDPDHFSAEG SGVVVRGLNF VRSELNGRDT FSANNGRALS FADVPSELMG GVDVFKSPSA DMIEGGISGT VNLRTRLPFD SKKRLLSLSA EESYGDFVKK WAPTYSALYS DQWDTEAGTF GLLLSAVDSK LWTRSDGTQV SNFGCRTNFT SAQTANPQAV TCPQGGKGVW FPRGAAFRST ETERERIGYA AAGQWRSNDD TMLATFQYLR SESQQSWTEH AMEIATDNVL AAGDSRPIDG TTFGVDSNGI FTNGIITGPQ GWRDDQNSAD PRTPSFGLQS NNISRSVEQK YVTSDYGFNF KWTPTDRIGV AFDYQHVDST VDNLDVGIWG SSFQNLDLKL NGSDMPVFSF IPPASGASIP QCSPPSGSCS TYLRAPYDHF QNPHNSFWRS AMDHIEQSEG KEDAAKIDVD YRFADDSSWL DSARVGVRWA ERDQTTRFST YNWGVLSEIW GGGGPVWFDD PVNGNPATAG GETSVARTEL YPFTDFMRGQ VPAPTGLDAR PFYLGNTATD YAGLQAFALK IGDEWRPRVA AGSTCPQNWV PLAQRCNTVA GTPFLPGEIN PINEKTKSAY AMLRFKHEFD GDVKVTGNIG LRYTSTTRDA TGFLTFPNTV PATDASCDLS FTNWQAQPDP KDPFVPSAFC ALSPTARQSV RNFNNGATVA QSAHAKFTYW LPSVNLKVAL SDGWQLRFAA SKTITPPEVG LTRNYYDVKL DTNSTGIING VVGGNTTVGN PYLKPTQSIN IDGSAEWYFA PVGSVTLALF WKELTDVATN TTARIPFTNN GSTFAVAVTT PGNSDVKAHV KGFEIAYQQF YDFLPKPFDG FGINANYAYI DSKGVPQSTL SATDPDVGAG RVSTVDTGLL PLQGLSKHNV NFAAIYEKGP ISARLAYNWR SDFLLTVRDV IVPFAPIMNE ATGQLDGSLF YTINPKVKIG VQGVNLTNEV IKTTQVLNND LLKAGRSWFM SDRRYTFVLR ASF
|
| |