Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4698 |
Symbol | |
ID | 5902160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5081267 |
End bp | 5083753 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641565217 |
Product | TonB-dependent receptor |
Protein accession | YP_001686316 |
Protein GI | 167648653 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.648479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACGA AAAAGCTGGC CTGTCTGGCG TCGACCGCTC TGGTCGGCTG CCTGCTGAGC GCCACCGCCG CCATGGCCCA ATCGACCGGC TCGCAAGCCA ACGAAGTCGA TAGCGTGGTT GTAACCGCCG CCGGAGCCCG CGCCGTCGCC GGCCAGATCG TAGAGACGTT GCCGAAGTCG CGCGCCTCGG TCGACGCCGC GTTCCTGGCG ACGCAATCCA CCGGCCAGAA CGTGTTCCAG TCGCTGAACC TGTTGCCGGG CGTCAGCTTC ACCAACAACG ACCCCTACGG TTCGTCGGGC GGCAACCTGC GCCTGCGTGG CTTCGACGGC GCTCGCGTCT CGGTCACGTT CGATGGCGTT CCGCTGAACG ACACCGGTAA CTACGCCGTC TATCCGAACC AACAGCTGGA CGCCGAACTG ATCGACCGCG CCAGCGTGAA TCTGGGCACG ACCGACGTCG ACAGCCCGAC CGCGTCGGCC ACCGGCGGCA CCATCAACTA CATTACCCGC AAGCCGGCCC ATGAGTTCGG CGGCATCGCC GACGCCTCGA TCGGCGAAGA TAACTATCGC CGCGGCTTCC TGATGGTAGA CACGGGCGAT ATCGGTCCGT TCGGTACCCG CGCCTTCGTG GCCGGCTCCT ACCAGAAGTA CGACAAGTGG AAGGGTCCGG GCTCGCTCGA GAAAAAGCAA GTCAACGCCC GGATCATGCA GGACATCGGC AATCGCGGCG ACTTCGTCAG CCTCGCGGTG AACTACAATG AGAACCGCAA CAACAACATC CGTCAGCTGT CGCTGTCCGA CTTCCGCACC TTCGGCAAGA ACTACGACTA CGACGCGGTC TGTAACCCGG CCGCCGTCAG TGGCGTCGTC GTTCCCGGCA ACTGCACCAA CTACTACGGC CGTCAGGTCA ACCCGTCGAA CACCGGCAGC ATCCGCGGCT CGGCCCTGTT CCATCTGGCC GACAACATCC GTCTGACGAT CGACCCGTCG TTCCAATACA CCCTGGCCGA CGGCGGCTCG CAGCTGGCCA CCGTCGCTGA AACAGATGGC CGGGTCCGTG GGACCGTCGC CGGCGCGCAA GGCCCGGCTA AGGACCTGAA CGGCGACGGC GACGCCGTGG ACACCGTGGC GTTCTTCGCG CCCAGCGTCA CCAACACCCG CCGCTACACC GTCACCAGCT CGCTGATCTG GGACCTGAAT GACGACAACC GCGTCCGCGT CGCCTACACC GGCGACTACG GCCGTCACCG CCAGACCGGC GAGTACACGA CGCTCGACTC GCAAGGCAAC ACGACCGACG TGTTCGGCGG CAAGGAAGGC CACGGTGCCA AGGTCCTGAC CTCGGACGGC AGCTTCTTGC GCGCTCGTGA CCGTTTCTCG ATCGCCCAGC TGAACCAGGT TGCCGCCGAA TACCGCGGCA AGTTCATGGA CAGCCGCCTG ACCGTGAACC TCGGCATCCG CGCGCCCTTC TTCAAGCGCG AGCTGAACCA GTACTGCTAC ACGCAGAACG CCACCGGCCT GACGACTGTG GTTTCAGGCT TCACGGTGCT CTGCACCACC CAGACCCCGG TCGTCACCAA CGCCGACGGC ACGGTGCAGT TCGCCCCCAA CGGCACGACC ACCGGCGCCG CCCTCGCCAA CCTGCGCTAC ATCAAGCCGT GGTCGGCCAC CGTGAAGTAC GACAAGGTCC TGCCGAACGC CGGCGCGACC TATGACATCG GCGGCGGCAG CACCGTCTAT GTCAGCTACG CCGAGGGCTT CTCCTCGCCG CGCACCGACA ACCTCTACAC TGCAACCCTG GAGAACAAGG TCGGCACTCC GGCCGACACC CGTCCGGAAA CCACCAAGAC CTACGACCTG GGCTATCGCT TCGCGAGCCC GACGGTCATG GCCACGGCCG CGGTCTGGAA GACGGATTAC AAGAACCGCA TCGTGCAAGC CTATGATCCG GATCTGAACA TCAGCATCGA CCGCAACGTC GGCGCGGTTA AGGCCTACGG CCTCGACACC CAGGCCGCCT GGGCCGTGGC CGAATACCTG ACCGTCACGG GTTCGTTCTC GTACAACAAG AGCGAAATCC AGCAAGATCT GCAGGTCAAC GCCGCCGGCG CCACCATCCC GCTGTCGGGC AAGCAGGTCG TCGAAACCCC GAAGTACACC TTCGGCGGCC GCGTCGACTG GGACGTCACC GAAGCGCTGC ACCTGGGCGT CCAGGGCAAG TACACCGGCG ACCGTTTCTC GACGGACGTG AACGACGAAG TGGCTCCGCA CTACACCGTG TGGGACATGT CGCTGGAGTA CGACCTGCCG TTCGCCAAGA AGACCTACGC CCAGCTGAAC GTGAACAACC TGTTCAACGA AACCTACTTC GGTTCGATCA GCTCGCGTAC GAACGCCTTG GCGCTTACGG GGTCGTCGGC AAGCGCGCCA ACCTACTACA TCGGCTCGCC GCGCACGGTT CAGTTCACCC TGCGCACCGA GTTCTAG
|
Protein sequence | MMTKKLACLA STALVGCLLS ATAAMAQSTG SQANEVDSVV VTAAGARAVA GQIVETLPKS RASVDAAFLA TQSTGQNVFQ SLNLLPGVSF TNNDPYGSSG GNLRLRGFDG ARVSVTFDGV PLNDTGNYAV YPNQQLDAEL IDRASVNLGT TDVDSPTASA TGGTINYITR KPAHEFGGIA DASIGEDNYR RGFLMVDTGD IGPFGTRAFV AGSYQKYDKW KGPGSLEKKQ VNARIMQDIG NRGDFVSLAV NYNENRNNNI RQLSLSDFRT FGKNYDYDAV CNPAAVSGVV VPGNCTNYYG RQVNPSNTGS IRGSALFHLA DNIRLTIDPS FQYTLADGGS QLATVAETDG RVRGTVAGAQ GPAKDLNGDG DAVDTVAFFA PSVTNTRRYT VTSSLIWDLN DDNRVRVAYT GDYGRHRQTG EYTTLDSQGN TTDVFGGKEG HGAKVLTSDG SFLRARDRFS IAQLNQVAAE YRGKFMDSRL TVNLGIRAPF FKRELNQYCY TQNATGLTTV VSGFTVLCTT QTPVVTNADG TVQFAPNGTT TGAALANLRY IKPWSATVKY DKVLPNAGAT YDIGGGSTVY VSYAEGFSSP RTDNLYTATL ENKVGTPADT RPETTKTYDL GYRFASPTVM ATAAVWKTDY KNRIVQAYDP DLNISIDRNV GAVKAYGLDT QAAWAVAEYL TVTGSFSYNK SEIQQDLQVN AAGATIPLSG KQVVETPKYT FGGRVDWDVT EALHLGVQGK YTGDRFSTDV NDEVAPHYTV WDMSLEYDLP FAKKTYAQLN VNNLFNETYF GSISSRTNAL ALTGSSASAP TYYIGSPRTV QFTLRTEF
|
| |