Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0321 |
Symbol | |
ID | 5897595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 363036 |
End bp | 365840 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641560805 |
Product | TonB-dependent receptor |
Protein accession | YP_001681956 |
Protein GI | 167644293 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0815676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGTC GTCCATCACA CCGGGGGGCC ATCATGGCCG GCGCGGGCAT TGTCGTCCTG CTCACCGCGA GCGCGGCGAT CGCGCAGGTT TCCGCCAGCG CCCCTGATGA CGAGACGGTC GAGGAAGTCG TCGTCGTTGG CGTGCGCGCC AGCGTGGCCA AGGCCGTCAA GCTCAAGCGC GACGCCACCA CGGTTCAAGA CTCGATCAGC GCCCTGGAAC TGGGCATGTT CCCGGACGAC AACGTCGCCG ACTCCCTCAG CCACATCACC GGCGTGTCGA TTTCGCGCAC CGCCGGCGGC GAGGGCCAGA AGGTCAGCGT CCGCGGCCTT GGTCCCGAAT ATACGCTGTC GACCTTCAAC GGACGCATCC TGGCCACCGA TGGCGCGGGC CGTGACTTCG CCTATGATGT GCTGCCAGCC GATGTCATCA GCGGCGCCGA CGTGATCAAG GGCGCCGAGG CGGCCAACAC CGAAGGCGCC ATCGGCGGCC TGATCAACCT GCGCTCGGCG AGCCCCTTCG ACAGGCGCGG CCAGCAGGGC ATCGTGCGCC TCGAGGGCGA CCGCAACCAG ATGTCGGAAC TGGACGGTCG CAAGCTGTCG GCCGTCTACA GCAACACCTT CGCCGGCGAC ACGCTGGGCC TGCTGCTCGG CGTGGTCCTC GAAAAGCGCG ACGACCGCAC CGACGTGGCC GGCAATGACG GCGGCTGGAC GCGCAACGCG GATCCGACCG ACGTAAGCTG GCTGTATGGC AACGCCTGGG GCGGCTCCAT CGACCCGAAC AACAACGGCG TGCTGGACCC CGAGGAATAC GGCCTGATCG GTCCGGGTCA GTTCCGCGTC GGTTCAATCC TGGAAAAGAA GAAGCGCCAC GCCTTCACCG GCAAGCTTGA GTGGCGGCCG TCCGATAACT TCAAGCTGGT CGTCGATGGA CTGAGCACCA AGCTCGATTC CCCCCAGGTC GGCTACCAGC AGTCCTTCTA CCCGCTCTAC GCGCCGGGCC GCTGGTCGAA CATGGTGGTC AAGAACGGCA TCGTCACCAG CTTCGACATG AACAATCCCG ATCCCGAGAT GCGTCTCAAT CCGGAGCTGC TCAACAAGAC CGAGTTCCGC GTCGTCGAGA CGCAGCTCTA TGGCGCCAAC GCCGAATGGA AGGTGTCGGA CACCCTGACC ATCACGGGCG ACGTCTACCG CTCGACCTCC AAGCGCCACT CGGGCGGGCA GGACAGCTAT GTCGTCCTGC GCATGAACCA GCCCAACAGC GCCCATATCG AGCTGACCGG GGAACGCGTG CCCAACGTCA CGGTCAATTT CGACGACGGC CGCGACCTGG CCAGCGGCCT GGAGAAGGGC CTGTTCCACG ACTCCGACTT CAACACCCAC TACTTCTCGC TCGCCGGCGA CAATATCGAC GACAAGATCA CCGGCGCCTC GTTCAAGGGC GCCTGGGCGA CAGGGCGCGG CTGGCTCGAC AACGTGCAGT TCGGGGTCAA CTACACCGAC CGCAAGAAGT CGCGCGACCT GGTCAACAAT GCGCTGACCG GCGGGGCCGA CTACTATTCC GGCGACTACG CGATCAACGT GGGCGCGCTC GGCGGCAACG TCATTTCCGA CAGCTTCTCC CTGCCCCACT TCATGAGCGA GGTGGACTCC AAGTTCCCGC GCACGTTCCT GTCCTTCGAC ATCCCCAAGT ATCAAGCGGC GCTCGCGGCC TATAACGGCA AGCCACGCCC GGGCGGCGGC ACGTACGACT ATTCCAAGGC CGCGCCGGCC TGGAATCCCC TGCAAAGCTA CCGCGTGGGC GAAGAGACCT GGGCCGGCTT CGTGCAGGCC AATCTGGAGG GCAAGCGCTG GAGCGGCAAT GTCGGCCTTC GCGTCGTGCG CACCAAGACC AACGCCCAGG CCTGGGACGC CAAGATCCTG CAAGTCATCG AAAACGGCGC GTTCAACTAC ACGGCTGTCT ACGCCGCGCC GACCTCCATC GAGCAGAGCA ACACCTACAC CTACGCGCTG CCGTCGCTGA ACCTCAACTA CCGCTTCACC GAAGAGCTGC GCCTGCGCTT TGGCGCGGCC AAGACCATGG CCCGTCCGTC GGTCGCGACC CTGGCGCCGA CCAACACCAC CGAAAGCGTC TCGTGGGGCG AGTTCACCCA GATCTACAGC GGCAACGCCG AGCTCAAGCC CTATCAGGCC AAGCAGTTCG ACCTGTCGCT GGAATACTAT TTCCGCCCCA ACTCGGTGTT CAACGTGGCG GTGTTCCACA AGCACATCAC CGACCAGATC ACCACCAGCT GGGAACCGGG CCAGGATATC GGCGTACCCG GCCACCTGTT CAACATCAGC CGTCCGATCA ACGGCGACTA CGCCAAGGTG AAGGGCGTCG AGGTCGGCCT GCAGCACTTC CTGGACAACG GCCTGGGCGT GCGCGCGCAA TATACGCGTA ACTGGGCCAA GAGCTGGGTT GGTGACCAGG AGCGTCCGCT CGAAGGCATC GCGCCCTCGG TCTATTCGCT GGGCGTCTTC TACGACCACG GTCCGGTGTC GCTCAGCCTG TCGGGCGATC ACACCGCGGG CTTCACGACG GCCGTCAACG TGCTCGGCGC CGGTTACAAC GAGAAGGCCG ACGCGATCAC CTGGGTCACG GCCCACGCGT CGTACAAGAT CAACGACAAG ATGGACATCT CGCTCGAAGG CCAGAACCTT CTGGACGAGG CCAACACCTA CAGCATCAAC GGCAACTCGA TGCTGCCGCA GGGCTACTAT CGCTACGGGG CCAGCTACAA GCTGGGCCTG AGCTACCGCT TCTAG
|
Protein sequence | MSRRPSHRGA IMAGAGIVVL LTASAAIAQV SASAPDDETV EEVVVVGVRA SVAKAVKLKR DATTVQDSIS ALELGMFPDD NVADSLSHIT GVSISRTAGG EGQKVSVRGL GPEYTLSTFN GRILATDGAG RDFAYDVLPA DVISGADVIK GAEAANTEGA IGGLINLRSA SPFDRRGQQG IVRLEGDRNQ MSELDGRKLS AVYSNTFAGD TLGLLLGVVL EKRDDRTDVA GNDGGWTRNA DPTDVSWLYG NAWGGSIDPN NNGVLDPEEY GLIGPGQFRV GSILEKKKRH AFTGKLEWRP SDNFKLVVDG LSTKLDSPQV GYQQSFYPLY APGRWSNMVV KNGIVTSFDM NNPDPEMRLN PELLNKTEFR VVETQLYGAN AEWKVSDTLT ITGDVYRSTS KRHSGGQDSY VVLRMNQPNS AHIELTGERV PNVTVNFDDG RDLASGLEKG LFHDSDFNTH YFSLAGDNID DKITGASFKG AWATGRGWLD NVQFGVNYTD RKKSRDLVNN ALTGGADYYS GDYAINVGAL GGNVISDSFS LPHFMSEVDS KFPRTFLSFD IPKYQAALAA YNGKPRPGGG TYDYSKAAPA WNPLQSYRVG EETWAGFVQA NLEGKRWSGN VGLRVVRTKT NAQAWDAKIL QVIENGAFNY TAVYAAPTSI EQSNTYTYAL PSLNLNYRFT EELRLRFGAA KTMARPSVAT LAPTNTTESV SWGEFTQIYS GNAELKPYQA KQFDLSLEYY FRPNSVFNVA VFHKHITDQI TTSWEPGQDI GVPGHLFNIS RPINGDYAKV KGVEVGLQHF LDNGLGVRAQ YTRNWAKSWV GDQERPLEGI APSVYSLGVF YDHGPVSLSL SGDHTAGFTT AVNVLGAGYN EKADAITWVT AHASYKINDK MDISLEGQNL LDEANTYSIN GNSMLPQGYY RYGASYKLGL SYRF
|
| |