Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3280 |
Symbol | |
ID | 5900735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3547012 |
End bp | 3549783 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641563786 |
Product | TonB-dependent receptor |
Protein accession | YP_001684905 |
Protein GI | 167647242 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.322049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGGA CGCGTGACAG CCACAACAGG TCCCAATTGA TGCTCGCGGC CGGCGCGACG GCCCTGATGG TCGCGGCCGG AGCGCAAGCC CAGGTGCGCA AATTCGACGT TCCCGCCCAA CCAGCGGTCA CCGGCATACC TCAGTTTGGC CAGCAGGCCG AACTGCAGAT TCTGGCGCCC CAGACCGCGG TCCAGGACAA GCGCGTCAAC GCCGTGCGCG GCGCCTACAC GGCTGACGAG GGGCTGACGC GGCTGCTGCG CGGCGTGGGC CTTACCGTCG TCTCCAACAA CGGCCGCACG GTGGTGCTCA ACCAATCGGC GCAGGGAGCG GACCCGGCCG CCGACCCGGC GCCCCCGCAG GACCAGACGA TCGTCCAGGA AGTCGTCGTC ACGGGCATGA CCTCGCGTAA TCGACCGCTG ATCACGGCCT CGGCGGACAT CACCCTGGCG AGCCGCGCCG ACATCGAGCG CAAGGCGCCA CGTTCGACCG CCGACCTGCT GGAGCTCGTG CCCGGCATTT TCGTCGAAGG CACCGCCGGC GAACTGTCCA ACAACTATTC GGTGCGCGGT CTGCAGGGCG GCGGCCAGCG CTTCATCCAG CTGCAGGAGG ATGGCCTGCC GATCATCTAT CAGGGCGGCG GCGCGGACTT CTTCTTCTCC GAGGACGCCA CGATCGACCG GATGGAAGCG GTGAAGGGCG GCACCTCGGG CATCCTGACG GTCAACGGCG CGGGCGCGAC GGTCAACTTC ATTTCGCGCA AGCCCAACTT CGAAAAGCCC GAAGGAATGG TGCGGGCCTC GGGGTACGAC TATGGCCTCA AGCGCGGCGA TTTCTACTAC TCGGCCCCGA TCGCCAACAA CCTGGCGTTC AACGTCGGCG GCTATGTCCA GAGCAGCCCC GGCGTGCGCA AGAACACCTT CGACTACGAC GGTTACCGCC TGAAGGCGAT GCTGGAATAT CGCTTCGACG AGGGCGGCTA TCTCCGCGTG ACCGGCAAGG CCGGAGACAT GAAATCGGCC TACTACGCCG ACCAGCCCTA CGCCTACAGC AACGGCAAGC CGCGAGGCGT TCCGGGTCTG GACACCCAGT TCGGCAACAT CGGCGGCGAC GCCTTCAACC GCATTTCGGT TCCGGTATCG ACCTTCGTCG AAAGCGACGG CTTCCGTGAC TTCCGGCTCA GCGAGGGCGT GCGGGTCAAG ACCAAGCAAC TGCGGATCGA TTTCGAGAGG CCGCTCAACG ACAGCGTCGA GATCTTCGCC CGGGCCCGCT ATCTGGACCT GAAGGACGAC TTCAACGGCA TCTTCCCCGG TTCAGGCACC GGCAATGCGG GCCTGACCAG TGCGGTGAAC TACCTGACGC CCGGCGCCAG CTCGCCGATC AACAATCTGC TCACCGCAGG TCAGGCCGCC TATCCGGCCA CCGTGCGCTT CGGGGCCAAG AACCTGCGAA CCGGCGTGGT CATCGCATCG AATGACACCG CGACCCTGAA CGCCCTGAAC GGCAACGGCT TCCTGCAGGA GACGACGCTC AATCACGACT ATCAGTCGGG TCACGATTTC GGCGCCAACA TCGGGACGCG TTGGGAATAC CAGGCCGACG GGTTCCAGAA CTCGCTGACC GCGGGGGTTC AGTATTACGA CGTTAGCCGC AGCCAGAACC AGTCGGCGGT GGCGACGGTG GTCAACGACG TTCGCACCAA CAGCGACCTC TACGACATCG TTTCGCTCGA CGCGAACAAC CAGGTCATCG GCGTGCTGAG CGACAATGGG CTGGTCTCGT ACGGTGACTG GGGCGCGGGA ATGCGGCGGC GCACCGACAA GTCGGTGTCG CTCTACGCCA ACGACGAGCT GGCGATCGGC GACAAGATCC ACATCGACGG CGGCGTGCGC TGGGAAAGCG ACAAGGCCAG GTATCTCGAG GGCAACACCG CGGCCGTCAA CCAGCCGGTC CAGCCAGGGG TGGTTGGCGT GGTGCGCACC GTCGGGTCGA CCTTCGACGG AACCTATACC GAGCGGCGCA AGACGCAGGA CAAGATCGCC TGGAGCATCG GCGCCAGCTA CCTCTTCACG CCCCACTTCT CGCTCTATGG CCGCTACGCC AACGGCTTCC AGACCAACAA CACCGATCCG ATCACCAAGA TCGAACTGTA CGAAGCGGGC CTGCGTTTCG AATACGGCCG GGTCTTCAGC GGCTCGGCGA CGGTGTTCCG AACCAATTTC GACAACCAGT TCTACAACTT CATCGACCCC TCCGACCCCA CCCGGCAGAC CAGTTATCTG GCCGACCTAC GGACCAAGGG GCTGGAGATC GACGCGCTCG TGCGGCCCGT CGACTGGTTC TCGGTCAACG TCTCGGGCGT GCTCCAGGAC CCGACCCTCA ACAATCTCAG CCTGAACGGC GTGGCGCAGC CGACCTATGA CGGCAATCGC CCCGAGCGGA CGCCGGCGCG GCTATACACG ATCACCCCGA CGATCAAGCT GCCCAACGAC CGGGGCGAGA TCTATGCCCG CTACAAGTAC GTCGGCAAGA TCTACGCGGA CGCTGGCAAT GGCGTGGCGC TGCCGTCCTA TGGCGTCACC AGCGCAGGCG TCACCCTGAA CCTGAAGGAC AACCTCCAGG TCAATCTCAA CGTTGACAAC ATCTTCGACG TCATCGGCCT GACCGAAGGC AATCCGCGCC AGGGCCAGAC CCAGAATGCC TCCTCCGGCT ATTTCTACGC CCGTGGCATC GTGGGCCGGA CCTATGGCGG ATCGCTGACG CTCCGCTTCT AA
|
Protein sequence | MRRTRDSHNR SQLMLAAGAT ALMVAAGAQA QVRKFDVPAQ PAVTGIPQFG QQAELQILAP QTAVQDKRVN AVRGAYTADE GLTRLLRGVG LTVVSNNGRT VVLNQSAQGA DPAADPAPPQ DQTIVQEVVV TGMTSRNRPL ITASADITLA SRADIERKAP RSTADLLELV PGIFVEGTAG ELSNNYSVRG LQGGGQRFIQ LQEDGLPIIY QGGGADFFFS EDATIDRMEA VKGGTSGILT VNGAGATVNF ISRKPNFEKP EGMVRASGYD YGLKRGDFYY SAPIANNLAF NVGGYVQSSP GVRKNTFDYD GYRLKAMLEY RFDEGGYLRV TGKAGDMKSA YYADQPYAYS NGKPRGVPGL DTQFGNIGGD AFNRISVPVS TFVESDGFRD FRLSEGVRVK TKQLRIDFER PLNDSVEIFA RARYLDLKDD FNGIFPGSGT GNAGLTSAVN YLTPGASSPI NNLLTAGQAA YPATVRFGAK NLRTGVVIAS NDTATLNALN GNGFLQETTL NHDYQSGHDF GANIGTRWEY QADGFQNSLT AGVQYYDVSR SQNQSAVATV VNDVRTNSDL YDIVSLDANN QVIGVLSDNG LVSYGDWGAG MRRRTDKSVS LYANDELAIG DKIHIDGGVR WESDKARYLE GNTAAVNQPV QPGVVGVVRT VGSTFDGTYT ERRKTQDKIA WSIGASYLFT PHFSLYGRYA NGFQTNNTDP ITKIELYEAG LRFEYGRVFS GSATVFRTNF DNQFYNFIDP SDPTRQTSYL ADLRTKGLEI DALVRPVDWF SVNVSGVLQD PTLNNLSLNG VAQPTYDGNR PERTPARLYT ITPTIKLPND RGEIYARYKY VGKIYADAGN GVALPSYGVT SAGVTLNLKD NLQVNLNVDN IFDVIGLTEG NPRQGQTQNA SSGYFYARGI VGRTYGGSLT LRF
|
| |