Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3498 |
Symbol | |
ID | 5900953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3773302 |
End bp | 3776184 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641564004 |
Product | Phage-related protein tail component-like protein |
Protein accession | YP_001685123 |
Protein GI | 167647460 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAAAG TCATCATCAA GGGACCGCTA GCTAAGAAGC TCCCAGCCGC TTATCGCAAG TCGATCAGCT TTAAGGTCTC AACGATCCGC GAACTCCTGT GGGGGCTCGA TCAGTTCGGT TCGTTCTCCG CTGACCTAGA CGCTTTCCCG CACCAGTTCT TCGTCGGAAA GACGCTCAAG ACCGCTCACA GGTTGTCAGC CCAAGAAGCT GCCGCTGTGA AGCTGGACGA GCAAGAAGGC GTCTGCGTCT TCATCGTTCC ATGCGTCGCG GGTGCCGATC CCGTGACGAC AACGATGGTC GTTACAGCGC TTGTCAGTGC TGCAATCTCC ATCGGTATCA GCCTCATCAT GGCGGTTCTG TTTCCGCCGA CACAGGGAGG CAACGACACT CGAAAATCAG CCCTCTACGT CAACGGCATG AACACTCAAC AGGAAGGTAT TCCGATCCCT GTTGCGTATG GCACTGACAT CTTCTGCGGC TTCAACGTCA TCGAGGCCGA TGTCGAAGTC CTGAACACGG GCGGCATTTC GGGTTGGATG AACCCGCTAT CGGGTGCGCT CGCGGTCAAT GCTTCAGCGG GAACCGGCCT CTACGGATCG GGCGCTAACA ACTACGTCGA TCCCGCATGG GACATCCTGA AGGGCCTCGG CCTGAACGTG GATGGCTTGG GCGCAAAGGG CGGTGGCAAG ACCATCGCGA ACAACACCTT CTCCAACGCC TCCATGAAAA TCCTAGGCGC AGTCAGTGCG GGTCCAATCG GCGGATTGGT CGGTAACACG GTCGAAGAGC AGGAAAGCTC CATCTTCATC GACAAGACCG CCCTGCGTGA CGCAGCGAGC GGCAAGCTCT CCAAGCAGGG CTTCACATGG GCGACACGTC CAGGCGAGAC CGGCCAATCT GCCGTGCCTA TCACAGAGGC CATCGCAACG CCTTACGACG CTAGCCAGGA GCTTAAGAAG CTCACCAGTG CAGGCATCCC AACCGACATT CAGAAGACGT TCCCGAACGA CATCAACCGC TCGCGCTTCC GCTTCAACAT GGTCCTTCTT CAGACCAGCA AGAAGGGCAA CCAGAGCAAC GCGACTGTCG TGCTTGGTGC AGACTTCAAG CGCATCACAG ACACTTCATG GACTCCTTAT GCGAACTGGA CCTTCAACGG CAAAACCAGC ACGGGTGCCC AGCGCGAGAT TCAGGTGATT GCTCCGAATT GGAAGACCGA TGAGGAGTGG CAAGTTCGCA TCTATCGCGT CACCGAGGAC AGCACCGACG ACAAAATCCA GAACGCAACG ACGTTCAACG GCTGGGTCGA GATCATCGAC AAAGACCTAG CCTATGACGG TACGGCGGAC TCACCGCCTA CAGCGCTATT CGGTGCAAGC ATCGATGTCA GCCAGTTCGA CAGCGGCTCC AAGCCCGAGA TCATCCTACG CAGCGCAGGC CGAAAGGTCC GCGTTCCTCA TGTCGTTGGC TCCTATTGGG ATGGATCATG GGACACCAAA GTCACAGCCA ATCCGGTCTG GATTTGGTTC GACATGGCGA CAGACAAGCT CGTCGGCGGT GGCCTCTCCG ACACATGGTT CAACCGCTTT GAACTCTACG AGATCGCGCA GTTCTGCGAC CAACTCGTCA ATGGTCGTCC GCGCTTCACG CTGAACAAGC AATTCACCGA CAGCAAGGAA CTCTGGCAGC AGCTACGCGA AGTCGCGCAG TCCTTCATGG CCGTGGCCTA TTGGAACGGT TCGTCGGTGT CGCTTGTGCA GGACACGCCA AACGCGACGG TCAGCCACTA TATCACGAAC ACGATGGTTG AAGATGGCGC GTTTGCGTAC TCGTTCACCG ATCACATCGA GCGCTTCAAC GAAATCCTTG TCGAGTACGA CGACCCAACG CAGTTCGGCG CGAAGGGCAT TGCACCTTGG CAGGATGACG CGGCGATCAC TCGCGCCCGC GCGCTCAACT TCCCCAACGA TGGGAAGATC ACAAACACGA TCTACAAGAC CGGCTGCACC AACGCACAGG AGGCCTATGA CTGGGCTCGC CTGCTTGGGT ATGCCTCGCA GCGGGAAGTT AGGAACGTCG CGTTCGCCGC GCCTATCGCG GGCTCCACCT ACTTCCCTGG CCAGATCATC GAAATCGACG ACATGAACGT GTCGGGCAAA GAGCCCGTGG GTCGTGTCGC ACGCATCATC GACGCTGACC ATATCGAGCT AGACGCTCCC GTGACCTTGG AAGCGTTCAA GTCCTACACC CTACGCATCG TCGGTGAGAC CGTCCGTTCG ATCTCGCTGC CAATGCTCAC CGTGACCACG AAGAGCGCCA CTATCAACGC TCCTGCTCAC GGCGCGGTCG TTCAAGCTCC GGTCGGATTG ATCGAGCTTG GCACGGGTCC ACAGCCACAG CGCTTCCGCA TAATCGAAGT GAGCGAAGCC GGTCCCGCGA AGTACGAGGT CAAGGCGCAG CAGAGCATTA TCGGCAAGTG GGAAGAGGTC GAACAGCACG TCCCTGTTCC AATCCCAGAT TGGACAAACC GTGACACGTC GGTTCGCGCT CCAACGAACA TCGTGTTCAC GCCGCACGCG GCAGAAGACG ACATCACGGG CTCCTATACG TCACTCGAAA TCTCATGGAC CGGCCTGCCA ACCGGCGCGC TCGTTCGCGA GTACGTCCTT GAAGCAACCA TGCCCGATGG CGGTGTTTCA GTCGAACTCT ACAGAGGCGC AAACACGGGC TTTACGATCT CGCGTGCGCC TCCTGGCCTC TACGTCGTGT CGGTCAAAAC GATAAATATG CTTGGTGGCA GCAGCGAGCC TCTGTCTGGA AGCTACGTGC TTTCGACAGG CGATGAACTC GTCTTCCCAC CAGTCTTTAT AGGGTTTGAT TGA
|
Protein sequence | MVKVIIKGPL AKKLPAAYRK SISFKVSTIR ELLWGLDQFG SFSADLDAFP HQFFVGKTLK TAHRLSAQEA AAVKLDEQEG VCVFIVPCVA GADPVTTTMV VTALVSAAIS IGISLIMAVL FPPTQGGNDT RKSALYVNGM NTQQEGIPIP VAYGTDIFCG FNVIEADVEV LNTGGISGWM NPLSGALAVN ASAGTGLYGS GANNYVDPAW DILKGLGLNV DGLGAKGGGK TIANNTFSNA SMKILGAVSA GPIGGLVGNT VEEQESSIFI DKTALRDAAS GKLSKQGFTW ATRPGETGQS AVPITEAIAT PYDASQELKK LTSAGIPTDI QKTFPNDINR SRFRFNMVLL QTSKKGNQSN ATVVLGADFK RITDTSWTPY ANWTFNGKTS TGAQREIQVI APNWKTDEEW QVRIYRVTED STDDKIQNAT TFNGWVEIID KDLAYDGTAD SPPTALFGAS IDVSQFDSGS KPEIILRSAG RKVRVPHVVG SYWDGSWDTK VTANPVWIWF DMATDKLVGG GLSDTWFNRF ELYEIAQFCD QLVNGRPRFT LNKQFTDSKE LWQQLREVAQ SFMAVAYWNG SSVSLVQDTP NATVSHYITN TMVEDGAFAY SFTDHIERFN EILVEYDDPT QFGAKGIAPW QDDAAITRAR ALNFPNDGKI TNTIYKTGCT NAQEAYDWAR LLGYASQREV RNVAFAAPIA GSTYFPGQII EIDDMNVSGK EPVGRVARII DADHIELDAP VTLEAFKSYT LRIVGETVRS ISLPMLTVTT KSATINAPAH GAVVQAPVGL IELGTGPQPQ RFRIIEVSEA GPAKYEVKAQ QSIIGKWEEV EQHVPVPIPD WTNRDTSVRA PTNIVFTPHA AEDDITGSYT SLEISWTGLP TGALVREYVL EATMPDGGVS VELYRGANTG FTISRAPPGL YVVSVKTINM LGGSSEPLSG SYVLSTGDEL VFPPVFIGFD
|
| |