Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2407 |
Symbol | |
ID | 5899862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2624525 |
End bp | 2627479 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641562898 |
Product | TonB-dependent receptor |
Protein accession | YP_001684032 |
Protein GI | 167646369 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000117567 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.325532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAG TCCAGCTTAT TTATCAGAGT AATGCGCCCA CGATCCAGCG CGCCTCGAAG GCTGCTCTCG TCGGTGCGTC CAGCTTTCTG GCGCTCGCGC TTTGCGGCGC GGCCAACGCC GCCGACCGGC CGGCCGAGGA GCCAATAACG ACGGCCGCGG CCGTCGCCGC GCCAACCGGC GAGCAGGCTT CCGAGCCGAA ATCCAGCGGC GGCACGGTCG TCGAAGAAGT GGTTGTCACA GGCCTACGCG GTTCGCTGCA ACGCAACCTC GACATCAAGC GGACGTCACC GGGCGTCGTC GACGCCATTT CGGCCGAGGA CATCGGCAAA TTCCCCGATT CCAACGTCGC CGCATCATTG CAACGCCTGC CGGGCGTCTC CATTCAGCGC GCGGGCGCGC GTGGCGAACC GCAGGGTATC ACCGTTCGCG GCTTTGGCGG CGACTTCAAC GAGACCCTCT ACGACGGTCG TCGGATCTCC ACGGCCACGG GCGGCCGCTC GGTGGACTTC AGCACCGTGG GCGCCGACTT CGTCGGCGGC CTGTCGGTGC TCAAGACACC CGACGTCACA CTCTCGAGTA GTTCGATCGG CGCGACCGTC AACGTCGCGT TTCCAAAGCC GTTCGATCAT CCCGGCCGGC GCATGGCGTT CACCGCCTCG GGCTCGCTAC AGGACGACGC GGGCAAGGTG GCGCCCACCG TCGGCGCCCT GTTCAGCGAC ACCTTCGCCG ATAACCGGTT CGGGATCCTG GTCGATGCGA TGTACACGCG CCACGACACC CAGACCAACC GGGTCTATGT CAGCGGCTGG CCGGGCGGAC GCTACGCGCC GTGCCAATTG ACGCCGACCT GCACGCCAAC CCGGTTGGCC GACAAGTCCA TCGTCGGATG GTTCGAGCAA CAGTATGGCG CAAGCCAGAT CTATACCAAG GACGAGCGTG TCGATGGCCG CATCGCCCTG CAATGGAGTC CGTCCGAAGA CCTGACGGTC ACCCTGGACG ACAACTACTC GCGTCAGAAC ATCCGCGCCG ACAATTTCGG CTATGGCATC TGGTTTAATC AGGACGGCCT GAGAAACGTC AAGCTGGACA AGAACGGGAC GACCGTCGAT TTCACCCAAG CCGGATCTCA GACCGATTTC GTGGCTGGAA CCGATCGCTC GATCCTCCAG ACCAACCAGA CGGGTCTGAA CCTCAAGTGG GACGTGTCCC AGAACCTGAA CTTCGAGGCG GACGCCAGCT ACGCCAAGAG CTGGCTCAAC CCCGGCGGCG AGATCAGCAG CGACAACGCC GACGTGGGCT ACGGCTTCGC CATCGGCCCT GCTCTGGGCA TCAGCATCGG CGGCGACAGC AAGAACACCC TGCCGGTGTT GCATGGCTAC GGCCCAAACG GCGACGCCGC GCGCTGGGCC GACACTTCGG TCCTGGGCTC TCACGTCACC GTGCGCCAAG CTCAGGAAAA CACCGACGTC GTCAAGCAAC TGCGGTTCGC CGGCTCGTGG GAACAGGAAG GCTTCCGCAT CAAGGCCGGC GGCAGCTATC TGGAAGACCA CTACCAGTTC CAGCAGAGCA ACACCTTCGT CAACAACTAC TGGCAAGCCT ATCCGGGCTA CGGCCCGCCA TCGGGTCCCA ACGGCGGTGT CCTGGCGCCG TCCAGCCTGT TCACCGACAA GGTCAGCACC AACCACTTCA TCCCCGGCTT CTCCGGCGCC CTGCCGCCGA CGCTGTTGAA GTTCGACGCC CACGCCTATC AGCAGTTCCT CACGGCTCTT GGAAATCCCC AAACCCAGAC TATTCCGGGC TTCAACTATA GCGGCGGCAA CGTCGGGACC ACGTTCACCG GCGCGTTCAA TCTGGGGCTC GACAACGGCA GTATTCGCGA CATCACCGAG AAGACCTGGG CGCTGTTCCT GCGGGCCAAC TTCGACGTCG ACGTGGCCGG CATGCCGTTC CACTTCAACG CCGGCGTACG CGAGGAGAAC ACTCACGTCA CATCCAACGG CTTTGGTCAG GTGCCCACCG CGATCACCGG CAGCGCCGGC GATCCGACGC TGCTGACCGT GACCTTGAGC GCGCCTCAGG CCGTATCGAC CAAGAGCAAC TATTCCTACC TGCTCCCAAG CATCGACCTG AAGCTGGAAC TGACCGAGAG CATCCATCTG CGCCTCGATG CGTCTCGAAC GTTGACCCGT CCGAGTCTGA ACCTGCTCAC GCCGGTGGCC AGCGTCGGCA CCGGCCAGCG GGTCGGCGCC CTCACGGCCA GCGGTGGCAG CCCCTCGCTC AAGCCTTATC TGGCCGACAA TTTCGATGCG GCGGTCGAAT GGTATTACCG GCCCAACTCG TATGCGTCGG TCAATTTCTT CATCAAGGAC GTCAGCAACT TCATCATCGG CGGCACCCAG CGACAGACGA TCAACGGCGT CATCGATCCC ACGACCGGCC AGCCGGCGAT CTTCAGCGTC ACGCAGCAGG TCAACGGTCC GGAAGCGACC GTGCGTGGGG TTGAATTGGC CTGGCAACAC GTGTTCGGCG ACAGCGGTTT CGGCTTCAAC GCCAACGCCA CCCTGGTCGA CACGAACAAG CCCTACGATC GCACCGACAT CTCACAAAGC GGCTTTGCGA TCACCGGCCT GGCCGACTCC GCCAACCTCG TGGCCTTCTA CGACAAGAAC GGTCTCGAAG CCCGGGTCGC GGTCAACTAT CGCAAGGAGT ACCTGCGAGG CTTTGGTCAG AACCAGAACA CCGGCGCCTT CGGTTCTGAA CCGACGTTCG AAAATCCGAA CCTGCAGATC GACTTCAGCA CCAGCTACGC CCTGACCAAG CAGATCAACC TGTTCTTCGA AGCCCAGAAC CTCACCAACG AGACGCAGAG CACGCACGGA CGGTTCGACA ACCAACTGCT CGACGTATTC GCCTATGGCC GGCGCTACAC CGCCGGCGCA CGTTTCCGCT TCTAG
|
Protein sequence | MKTVQLIYQS NAPTIQRASK AALVGASSFL ALALCGAANA ADRPAEEPIT TAAAVAAPTG EQASEPKSSG GTVVEEVVVT GLRGSLQRNL DIKRTSPGVV DAISAEDIGK FPDSNVAASL QRLPGVSIQR AGARGEPQGI TVRGFGGDFN ETLYDGRRIS TATGGRSVDF STVGADFVGG LSVLKTPDVT LSSSSIGATV NVAFPKPFDH PGRRMAFTAS GSLQDDAGKV APTVGALFSD TFADNRFGIL VDAMYTRHDT QTNRVYVSGW PGGRYAPCQL TPTCTPTRLA DKSIVGWFEQ QYGASQIYTK DERVDGRIAL QWSPSEDLTV TLDDNYSRQN IRADNFGYGI WFNQDGLRNV KLDKNGTTVD FTQAGSQTDF VAGTDRSILQ TNQTGLNLKW DVSQNLNFEA DASYAKSWLN PGGEISSDNA DVGYGFAIGP ALGISIGGDS KNTLPVLHGY GPNGDAARWA DTSVLGSHVT VRQAQENTDV VKQLRFAGSW EQEGFRIKAG GSYLEDHYQF QQSNTFVNNY WQAYPGYGPP SGPNGGVLAP SSLFTDKVST NHFIPGFSGA LPPTLLKFDA HAYQQFLTAL GNPQTQTIPG FNYSGGNVGT TFTGAFNLGL DNGSIRDITE KTWALFLRAN FDVDVAGMPF HFNAGVREEN THVTSNGFGQ VPTAITGSAG DPTLLTVTLS APQAVSTKSN YSYLLPSIDL KLELTESIHL RLDASRTLTR PSLNLLTPVA SVGTGQRVGA LTASGGSPSL KPYLADNFDA AVEWYYRPNS YASVNFFIKD VSNFIIGGTQ RQTINGVIDP TTGQPAIFSV TQQVNGPEAT VRGVELAWQH VFGDSGFGFN ANATLVDTNK PYDRTDISQS GFAITGLADS ANLVAFYDKN GLEARVAVNY RKEYLRGFGQ NQNTGAFGSE PTFENPNLQI DFSTSYALTK QINLFFEAQN LTNETQSTHG RFDNQLLDVF AYGRRYTAGA RFRF
|
| |