Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4100 |
Symbol | |
ID | 5901562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4452896 |
End bp | 4455853 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641564620 |
Product | TonB-dependent receptor |
Protein accession | YP_001685722 |
Protein GI | 167648059 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAAC GTACTCGTGC GCGCGTCTAT TGGGTCGGCG GCGCGTCAGC CGCCGCCGTC ATATTGTCAG CCGGCCTAGC GAACGCTCAG GCAAAACCGC CCACGGGCAA CATGGTCGAG GAAGTCGTCG TCACCGCCCA GCACAAGCAG GAGGCGGCCC AGACCGTGCC GATCGCCATC TCGGCCTTCT CGCAGAAGAC CCTGGACGCC GCCAAGATCG AAGGCGGCCC CGACCTGTTG AAGGCCATCC CCAATGTCAG CTTCACCAAG ACCAATTTCA GCGGCGTGAA CCTGACGATC CGCGGCATCG GCACCCAGGC GGTGTCGGTC TCCACCGATC CCGGTGTCTC GATCAATTTC AACGGCACGG CCTTCATCCG CAACCGCTTC TTCGAGCAGG AGTTCTTCGA CCTGGAGCGG GTCGAGGTGC TGCGCGGGCC GCAGGGCACG CTTTACGGCC GCAACGCCAC GGCCGGCGCG GTCAACATCA TCTCCGCCCG CCCCACCGAC CTCTTCGAGG GCGAGATCAA GGGCGAGGTC GGCGACTACG CCAGCCGTCG GCTGAGCGGC TTCGTGAACA TCCCGCTGAT CGGCGACAAG CTGGATCTGC GCCTGGCGGG AGCCTCGACC AACCGTGACG GTTTCGCCAA CAACACCACC ACGGGCAACA AGATCGACGG CCGCGACCTC TATTCCACCC GGGTGAGCCT GGGCTTCAAG CCAGCCGAAT CCGTGCGCGC CAATCTGGTC TGGGAGCATT TCAGCGAGAA CGACAACCGC GCCCGCAGCA CCAAGCAGCT CTGCACGCGC GACCCCGGCA TCCCGAGCCT GGGCGGCATG ACGCTCAGCC CGCAAACCAG CGCCTTCTTC AGCCAGGGCT GCCTGAACGC CTCGCTCTAT TCGGACGCCG CTTACGGCAC GCCCAACGGC CTGTCGATCC CGTTCGTCCT GGCCTCGGTG ACGAAAACCG CCGTCCTGGG CCTGAATCCC GACGGGTTTT CCGGCTCGGT GGATCTGCTG AAGGCCGTCG ATCCCTATGG CGGCGCGATG CAGTCTCGGG ACCTGCGGCA GGTGTCGTCG ATGTTCGACC CGAAGTACCG AGCCAAGGCC GACACCTTGG AGCTGAACGT CGATTTCGAC CTGTCCCCGA CCCTGACTCT GACCTCGCAG ACCGCCTATA ATCGCGATCA CTATTTCTCC AGCCAGGACT ACAACCGCTT CAACACCCAG CCTGGAGTCT TCAACGACTC CACGGGCCTG GACAACTTGG TCGTTGCCGG AACGCCCGCG CCGAACCTCA CTCCGGGCGG CGTCTATTGC GATCCCCAGC TGGGCTGTTC CAGCACCATC GCCGGCCTCG ATCAGTCCAA GGCCAAAAGC CGGCAATTCT CGCAGGAGCT GCGCCTGCAA TCGGCCTTCG ATGGCCCGCT CAATTTCAGC CTGGGCGCCA ACTACACCAA CTACCGAACG ATCGAGGACT ACTACGTCTT CTTCAACATC ATCTCGGCCA TCGCCCAGAG CAGCGGCTAT GGCAATAACA GCCTCGACCC TACCAAGTGC GGCAAGAATT TCGACGGTTC GACACCGACG ATCACCCCCG GCGGCACGAC GGGCTGCATC TATATCGACC CCAACCCGCT GGACCAGGTG AATGGCCTGG GCCACAACTA TTTCCGCAGC CAGAACGTGT ACAAGGTCAG CTCAAGCGCA GCGTTCGGCG AGCTGTACTA CAATATCACG CCGGACCTGA AGCTGACGGG TGGTCTGCGC TACACACGCG ATCGCAAGAC CTTCACCCCC GTGCCGACCC AGGTGCTGCT GTCGTCCGGG AACGGCGGAA CGGTCGATGG CGGCTATCCA GTCGGCGAGG ACATCAAGCA GCACTGGGGC GTGATCACCG GACGCCTGGG CCTGGACTGG TCGCCCAAGC TGTCCTTCAC CGACCAGACC CTGCTCTATG CGTTCTACAA TCGCGGCTAC AAGGGCGGCG GCGCCAACCC GCCCGCCATC GGCTACAGCG ACGAGCCGCT GGCCCCTGGT CTGCCGCCCC TGGTGCAACT GACGTCCTAT CCCGACACGT TCAAGCCCGA ATACGTCAAC GCCTTCGAGG TCGGGACCAA GAACACGCTC CTGGGCGGCG CCCTGACGCT CAACACCTCG GCCTTCTACT ATGACTACAA GGGCTACCAA GTTTCCCAGA TCCAGGATCG GACCGCGATC AACGAGAACT TCGACTCCAA GGTCTGGGGT CTGGAGTTCG AGTCGAACTG GCGCCCCACC CAGGCCCTGC GCCTGACCCT CAACCTGGGC TATCAGGACA GCAAGCTCGC CGACGGCTCC AAATCGATCG ACGTCGAGAA CCGCACCCAG GGCAACCCCG ATTTCACGGT GATGAAAGCC AACCCCTTCC TGCCGTCCAA CTGCGTGGTC TCGAAGGCCT ATGTGACCAG CTTGATCGCC GCCAACGTCG CCACCAACCA GCCCGATTAC GCCTATTTGG GCGGCGTCTG TCCGGGTTCG ATCGCGAGCC TGTTCCTGGT CCCCGCCCCG ACCGAGGCCG ACCTGCCCAA TGGCGGGCGC GGGTTCTATG CCGACCTGTC GGGCCACGGC CTGCCCAATC AGCCCAAGTG GACCCAGTCG CTCAGCGCCG AATACAGTCT GCCCCTGAGC GACGGCTGGG TCGGCGCGAT CCGCGGCGAC GTCTACCACC AGTCCCAGTC CTGGGCGCGG GTGTACGAGG ACGCCATCGA CAAGCTGCAC GGCTGGTACA ACGTCAACCT GCGGCTGACC GTGGCCAAGC CGGACTCCGG CCTGGAGTTC GAGCTCTACG CCAAGAACCT GCTGGATGAC TCGCCCATCA CCGGCGCCTT CATCAACAGC GACGACACCG CCCTCACCAC CAACGTCTTC ACGCTCGATC CGCGCGTGAT CGGAGCCAGC GTCCGCAAGA GATTCTGA
|
Protein sequence | MSQRTRARVY WVGGASAAAV ILSAGLANAQ AKPPTGNMVE EVVVTAQHKQ EAAQTVPIAI SAFSQKTLDA AKIEGGPDLL KAIPNVSFTK TNFSGVNLTI RGIGTQAVSV STDPGVSINF NGTAFIRNRF FEQEFFDLER VEVLRGPQGT LYGRNATAGA VNIISARPTD LFEGEIKGEV GDYASRRLSG FVNIPLIGDK LDLRLAGAST NRDGFANNTT TGNKIDGRDL YSTRVSLGFK PAESVRANLV WEHFSENDNR ARSTKQLCTR DPGIPSLGGM TLSPQTSAFF SQGCLNASLY SDAAYGTPNG LSIPFVLASV TKTAVLGLNP DGFSGSVDLL KAVDPYGGAM QSRDLRQVSS MFDPKYRAKA DTLELNVDFD LSPTLTLTSQ TAYNRDHYFS SQDYNRFNTQ PGVFNDSTGL DNLVVAGTPA PNLTPGGVYC DPQLGCSSTI AGLDQSKAKS RQFSQELRLQ SAFDGPLNFS LGANYTNYRT IEDYYVFFNI ISAIAQSSGY GNNSLDPTKC GKNFDGSTPT ITPGGTTGCI YIDPNPLDQV NGLGHNYFRS QNVYKVSSSA AFGELYYNIT PDLKLTGGLR YTRDRKTFTP VPTQVLLSSG NGGTVDGGYP VGEDIKQHWG VITGRLGLDW SPKLSFTDQT LLYAFYNRGY KGGGANPPAI GYSDEPLAPG LPPLVQLTSY PDTFKPEYVN AFEVGTKNTL LGGALTLNTS AFYYDYKGYQ VSQIQDRTAI NENFDSKVWG LEFESNWRPT QALRLTLNLG YQDSKLADGS KSIDVENRTQ GNPDFTVMKA NPFLPSNCVV SKAYVTSLIA ANVATNQPDY AYLGGVCPGS IASLFLVPAP TEADLPNGGR GFYADLSGHG LPNQPKWTQS LSAEYSLPLS DGWVGAIRGD VYHQSQSWAR VYEDAIDKLH GWYNVNLRLT VAKPDSGLEF ELYAKNLLDD SPITGAFINS DDTALTTNVF TLDPRVIGAS VRKRF
|
| |