Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2383 |
Symbol | |
ID | 5899838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2590565 |
End bp | 2593345 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562874 |
Product | TonB-dependent receptor |
Protein accession | YP_001684008 |
Protein GI | 167646345 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTACA AGGACATCAG CTACGCGGTC CTGCTGGCCG CGACGACCGC CGCCGGCGTC CACGCGCCGG CCTTCGCCCA GGCGCCGACG GTGACGTTCG ACATTCCGGC GGGCGACCTG ACCGCGTCGC TGAACACCTT CGCGCGCCAG GCCGGCGTGC AGATCTTCTT CCCGTCGGCC GAGCTGGCGG GCCGCAAGGC GCCGGCCATC AAGGGCGCGA TGCCGGCCCA GGCCGCGCTG TCGAGGCTGC TGGCCGCCGG CGAGCTGGAG ATCGCCGCCG ACGATGGCCA CACCATCTCG CTGCGCCCGG CCCGGCGGGA CGCCGCCGTG GCCCTGGACG AGATCCTGGT CACGGCCCAG AAGCGCGAGC AGAAGACCAT CGACGTGCCG TTCGCCCTGA CCGCCTACAG CGGCAAGGCG CTGGAGCGGC TGGGCGTCAC CAACTTCCGC GAACTGTCGA CCCACGTGCC AGGCCTGATG GTCGAGGACC AGTCGCCCAA CAATCCGATC TTCGTGATGC GCGGCATCAC CTCGTCGGGC GGGGACTCGT TCACCGAGCC GCGCGTGTCG GTGTTCCAGG ACGGCGTGTC GATCTCCAAG TCGCGCGGCT CCTACATCGA GCTCTACGAC AACGCCCGGG TCGAGGTGGC CAAGGGGCCG CAGTCGACGC TGTTCGGGCG CGGCGCCTTG ATCGGCGCGC TGAACGTGAT CCAGAACAAG GCCGGTCCGG CCCCCGATTG GTCGGTGGCG GCCGAGGGCG GCAACCTGGG CTACCACCAG CTGGACGCCA TGCTGAACCT GCCGGTGTCC GACACGGTGT CCCTGCGCGT GGCGGGGCGT CGCAAGAGCC GTGACGGCTA TGAGAAGAAC CTCGACCCCG CCGCCGACGG CGACCTGAAC GCCATCGACA CCAACGCCTA CCGGGTCGCG CTGAGCTTCA AGCCGAACGA CCGCTTCAGC GCCGACCTGA TCTACAACCA CCAGGACGAC GAGACGAACG GCACCGGCTT CAAGTCGATG TATGTCAGCC CGACCGACCC GGCGACGGGC AAGGTGCTGG CCGGCACGCG GGTGGACGAT CCGGTCTGGC TGTCCAAGCC CGCCGACTTC GCGCTCGGCC ACCATCTGGG CGTCGACCAG TACCAGAACG GCGTCATGGC GCTGGTGAAG TACAAGGTCT CCGACGCCCT GACCCTGAAC GCGACGACCA GCTATCGCGA CATCGACGCG GTCGAGGTCT ACGACGCCGA CGGCACGTCG CTGCCGCTGT TCACCAACAT GGAGGACGTC GGCGGCACGC AGGTCAGCCA GGAGCTGCGC CTGAACTACG ACAAGGGTGG TCGCTTCTCT TGGTTCGCCG GCGCCAACTA TTACCGCGAG CGCTCCAAGG CCCGGGTCGA TGTCCGCTTC GACGAGCGGA TGCTGCTGGC CCAGGCGGCG GGCATGCTGA GCGGCGGACC GTTCACCGGC CTGCCCAAGA CCACGCCAGC GCCGGCCTCG CTGTTCGAAA GCACGGCCTT CACCGGCGCG CTGCTGCAGG GTCTGGTCAC CCAGTCCAGC AAGGGCAATC TGGTGCTGAC CAGCGCCGAG GCCGGCGCTC TCGCTGCGCG GCTGGATCCA CACCACGTCG AGACCTCGCG CAACGAGTCC GACCTCGACG CCTATGATCT GTTCGGCGAC ATGACCTTCC ACCTCACCGA CCGCTTCGAG CTGTCGGGCG GCCTGCGCTA CAGCCGCGAC GAGAAGACCA CGATCTGGGG CAGCTCGGTG CAGGGCCGCA GCATCCTGGG CGGCGCCATC GGCGCGGCCG GGATCGCCGC CACCGGCGCG CCGGCCGGCG TCGCCACGGC CCGCGCCCTG ATCCAGGGCA TGACCTTCTA TGGCCCGACC CTGAACGGTC CGGTCCCGCT GTTCGGGGTG TCGGCTCAGC CCACGGCCCA TAATGGCGAC TTCGCCAGCC GCGACCTGAC CGATGATGGC GTCACCTGGC GCCTGACCGG CCGCTATGCG CTCAGCCCTA CCGCCAACCT CTATGCCAGC TATTCGCGTG GTCGCCGGCC GGGCGTACTG TCGGCCGGGG CGCCGGGCGC GCCAGACGGG ACACCCACCT TCGCCATCGC GCCGGCCGAG ACCGCGCAAG CCTACGAGAC CGGGATCAAG GCCGACCTGC TGGACCGCCG CCTGCGGATC GACGGCTCGC TCTACTACTA TGACTACGAC AACTTCCAGA CCCGCGAGCA GCGTGGCTCG ACCTTCGTGA CCACCAACGC GGGCACGGCG CGGGCCTACG GGTTCGAGAG TCAGGCCGAT TTCGCCGCGA CGCCGAACCT CGACCTGTTC GGCACCTATG CCTACAACCA CGCCCGCTTC ACACGCGGCG CCTATGAGGG CAACCACTTC GCCCGCTCGC CGGACCACAT GGTGTCGCTG GGCGCCTCAA TGCGCTGGAC GGGCCTGGGG GGCAGGTTCG ACTTTCGGCC GACCTACACC TGGCGTTCGA AGATCTTCTT CGCCGACGAC AACGACCGGC CCGAGCTGCA GGCGGGCCTG CTGGTCCCCG ACGCCGGCCA GGACGAGTTC CAGAACGGCT TTGGCCTGCT CAACGCCCGG ATCAGCTACG CGCCGGAACG GGGTAGCTGG GAAGTGGAGG CCTTCGGCAG CAACCTGACC GACGAGATCT ATCGCAAGGG CGCGGGGAGC GCCGGCAAGT CGATCGGCTT GCCGACCAAT GTGCTGGGCG AGCCGCGCGT CTACGGCCTG CGCCTGACCA TCCACCGCTA G
|
Protein sequence | MTYKDISYAV LLAATTAAGV HAPAFAQAPT VTFDIPAGDL TASLNTFARQ AGVQIFFPSA ELAGRKAPAI KGAMPAQAAL SRLLAAGELE IAADDGHTIS LRPARRDAAV ALDEILVTAQ KREQKTIDVP FALTAYSGKA LERLGVTNFR ELSTHVPGLM VEDQSPNNPI FVMRGITSSG GDSFTEPRVS VFQDGVSISK SRGSYIELYD NARVEVAKGP QSTLFGRGAL IGALNVIQNK AGPAPDWSVA AEGGNLGYHQ LDAMLNLPVS DTVSLRVAGR RKSRDGYEKN LDPAADGDLN AIDTNAYRVA LSFKPNDRFS ADLIYNHQDD ETNGTGFKSM YVSPTDPATG KVLAGTRVDD PVWLSKPADF ALGHHLGVDQ YQNGVMALVK YKVSDALTLN ATTSYRDIDA VEVYDADGTS LPLFTNMEDV GGTQVSQELR LNYDKGGRFS WFAGANYYRE RSKARVDVRF DERMLLAQAA GMLSGGPFTG LPKTTPAPAS LFESTAFTGA LLQGLVTQSS KGNLVLTSAE AGALAARLDP HHVETSRNES DLDAYDLFGD MTFHLTDRFE LSGGLRYSRD EKTTIWGSSV QGRSILGGAI GAAGIAATGA PAGVATARAL IQGMTFYGPT LNGPVPLFGV SAQPTAHNGD FASRDLTDDG VTWRLTGRYA LSPTANLYAS YSRGRRPGVL SAGAPGAPDG TPTFAIAPAE TAQAYETGIK ADLLDRRLRI DGSLYYYDYD NFQTREQRGS TFVTTNAGTA RAYGFESQAD FAATPNLDLF GTYAYNHARF TRGAYEGNHF ARSPDHMVSL GASMRWTGLG GRFDFRPTYT WRSKIFFADD NDRPELQAGL LVPDAGQDEF QNGFGLLNAR ISYAPERGSW EVEAFGSNLT DEIYRKGAGS AGKSIGLPTN VLGEPRVYGL RLTIHR
|
| |