Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3289 |
Symbol | |
ID | 5900744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3558401 |
End bp | 3561175 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641563795 |
Product | TonB-dependent receptor |
Protein accession | YP_001684914 |
Protein GI | 167647251 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAACC ATAAATCACG GCGCGGCCGG GCCACGGGTT TGCTGGGCGG AGCCTCGCTG GCCGCGCTCT GCGCGTTTGG GATCAACGCC AACGCTCAGG CTCAGGAAAC CGCCAAGCCG GAAGACAGCA CTGTTGAGGC CGTCGTGGTC ACGGGCTCTT TCCTGCGCAA CATCCGCCAG GAAGACATCG CCTCGCCCAC GGTCTCGATC GACCAGGCGC AGGTGGCGAA GACCGGCGTG CTTTCGGTGG GCGACCTGCT GCGCTACGTG CCGCAGAACG TCGGCAGCAT GGGCGGCGTG CAGGATCTGG CCAAGGGCGG CCAGGACAGC AAGGATACAC GCTCGGCGAA CTTGAGGGGG CTGGGCGCGG GGGCGACCCT GGTGCTGATG AACGGTCGCC GGGTCGTGAA GTCCGATGGC TACGTCAATC TCAACAGCCT GACGCCGGCG ATCGCCATCG CGCGGGTCGA GACGGTCCTG GACGGTGCGT CGGCCACCTA CGGCGCGGAC GCCGTGGCCG GTGTGTTCAA CATCATCACC GACGCCCGTT TCTCGGGCGC CAAGGCTAGT GCTCAGTACA CCTATGTTGA AGGCTCTCCG GCCTACCAGG TACAGGCGAT GGTCGGCGCG CAGGGCGAGC GCGCGCATCT GGTGGTCGCC GGCTCCTATA CCGACATCTC GCGTCTGCAA AATTCCGAGC GCGACGTCAC CAACATCTAC ATTCCGTCAT CGGGCGCCGG CGCCAATCCA GGCTCCTTCA CCCTGACGGC TAGGCCGCGC ACGTCGACGG GCGGCGACGT GGTGATCAAC AACGTCAACT ATTCGACGCT CTACGACACC TATAAGACCT CGGCCAACAC CCTGGCGGTC GTCGATCCCA ATTGCGGCTC GGCCGCGACC AAGAGTGTGT TCACGCCATC AGCCGCCGGA CCAGGCTACG CTATCGGTTC ATGCGCCTTC AGCTTCCAGG CCCAAAACCC CATCCGCGGC GCTAGCCAGA ACTATCTGCT GCACGCCGAG GGTGACTATC TGCTGACCGA CAATCACACG CTGTTCTTTG AAACTAGCAT CAATCACCAG GACTCTCAGC GCTACGGCGT GCCGTCCTAC TCACAGAACC ACAATGGTGC GACGCCGCCC GTGGTGCCGG CCTCCAATCC GTACAATCCC TTCGGCGTGG CGGTCTACTA TGTCGGGCGG CCGATCGGCT CTGAGGGCCT CGCTGGCACG ATGTACAATA TCCAGCGCAA CGAGGTGAAC CAGTTCCACA ATGTGCTGGG CGCCAAGGGT TCGCTGTTCG CTGACTGGCG CTACACCGCC ACCTTCACCT CGTCACGATC GACGAACATC TTCCGCGACC ATGATACGGA CATGAACCTC TTCCAGGCGG CGCTGAACGG CTATGGCGGC GCCAACTGCA ACTATCGCTT CAACGGTCCC GGCGTCGGCG CGGTCGCGGG GCAGGGTAAC TGCTACTATC TGAGCCCGTT CGCCAAGGAC AACGCCAGCC AGAACGCGGC GATCCTCCAC AACATCCAAA CCGAGGTCGT GACCCGCACC CAACGTGACT ACCTGATCGG TGACGTCGTC GTGGACGGCA CCCTGGGGGA GATCAAGCTC CCAGGCGGCC CGATCTCGGC GGCGTTTGGC GTGCAGTCGC GACGTGAAGG CCAGAAGATC ACCTATTCGG ATCTGCTCAA GAGCGGGTAC GCCGCCTTTG GCGGGCCGTC GGTCGATCTG GACGGCGAAC GCACCATCAA CAGCGTTTTT GGTGAGTTCA ACTTCCCGAT CATCGAGGGT CTGAACGTAG ACGTCGCCCT GCGGCACGAA GACTACGGCG GCTTTAAAAC CACCGACCCG AAGGTGGCGA TCAACTATCG GCCGGTGGAG AGCCTGTCGT TCCGCGCCTC GGCCAGCACC GCCTTCCAGG CCCCCAGCCT GGAAAGCACT TCCAGCGGCC CGATCAGCAA CAACGTCGTC AACATCACCG ACCCGGTGCA GGGTAACACC ACCTTCCGCA CAGTGACGAC GGTTGGCAAT CCCGACCTGC AGCCGCAGAC CGCCAAGGTG CATAACTTCG GGGCAACCTG GCTGCCGGTG TCGCGCGCCT CGCTGTCGCT GGACTGGTGG ACTTTCAAGT ACGACAACCA GATCGCCATC GAGAACGGCC AGGCGGTGAT TAACGCCAAC CCGACCGGCA GCCAGGTGAT CCGCGACCAA AGCGGCGCGG CCCAGACGGT GCTGGTGCGC AGCTACAACG CCAAGAGCGG CACCCAGACC TCGGGCCTCG ACCTGATGGC GACCTACGCC TTCGATTGGG GCGCCAGCAC CTTCACCCTG CGCGACAGCC TCAGCTATCT GCTGAAGTAC GACATCGACA CCGGCTCGGG CGTCTATGAC GGGGTGGGGC GGCGCAACAA CGCCATCACC TCGCCGCTCA CGGCCGCCGC CGCGCCGCGT TATCGCAACA CCGCCGGCGT CGACTGGTCG ATGGGGCGTC ACCAGGCCAG CGCCACGGTG CGTTACGTGT CCGGCGTGGA GGACGACTAC GCCATCGCCG TCACCGCCAA GGCCGCCACC AAGGTCAAGT CCTGGACGGT GCTGGATCTG CAATACGGCG TCGGCCTTGG CGAGGACGAG CGCTATCGCC TGACCGTCGG CATGATCAAC GCCTTCGATA AGGCCCCGCC GTCCGCCAAG TACACCGGCT ATCTGCAGGC CCTGGCCGAC CCGTTCGGTC GTCAATCCTA TGTCCGACTT GAGGCGCGGT TCTAA
|
Protein sequence | MGNHKSRRGR ATGLLGGASL AALCAFGINA NAQAQETAKP EDSTVEAVVV TGSFLRNIRQ EDIASPTVSI DQAQVAKTGV LSVGDLLRYV PQNVGSMGGV QDLAKGGQDS KDTRSANLRG LGAGATLVLM NGRRVVKSDG YVNLNSLTPA IAIARVETVL DGASATYGAD AVAGVFNIIT DARFSGAKAS AQYTYVEGSP AYQVQAMVGA QGERAHLVVA GSYTDISRLQ NSERDVTNIY IPSSGAGANP GSFTLTARPR TSTGGDVVIN NVNYSTLYDT YKTSANTLAV VDPNCGSAAT KSVFTPSAAG PGYAIGSCAF SFQAQNPIRG ASQNYLLHAE GDYLLTDNHT LFFETSINHQ DSQRYGVPSY SQNHNGATPP VVPASNPYNP FGVAVYYVGR PIGSEGLAGT MYNIQRNEVN QFHNVLGAKG SLFADWRYTA TFTSSRSTNI FRDHDTDMNL FQAALNGYGG ANCNYRFNGP GVGAVAGQGN CYYLSPFAKD NASQNAAILH NIQTEVVTRT QRDYLIGDVV VDGTLGEIKL PGGPISAAFG VQSRREGQKI TYSDLLKSGY AAFGGPSVDL DGERTINSVF GEFNFPIIEG LNVDVALRHE DYGGFKTTDP KVAINYRPVE SLSFRASAST AFQAPSLEST SSGPISNNVV NITDPVQGNT TFRTVTTVGN PDLQPQTAKV HNFGATWLPV SRASLSLDWW TFKYDNQIAI ENGQAVINAN PTGSQVIRDQ SGAAQTVLVR SYNAKSGTQT SGLDLMATYA FDWGASTFTL RDSLSYLLKY DIDTGSGVYD GVGRRNNAIT SPLTAAAAPR YRNTAGVDWS MGRHQASATV RYVSGVEDDY AIAVTAKAAT KVKSWTVLDL QYGVGLGEDE RYRLTVGMIN AFDKAPPSAK YTGYLQALAD PFGRQSYVRL EARF
|
| |