Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2854 |
Symbol | |
ID | 8334203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3257843 |
End bp | 3261151 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644955998 |
Product | portal protein |
Protein accession | YP_003113604 |
Protein GI | 256392040 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0279665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00446406 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTCCGGT CGATGTTCAA CGTCGTCGGC AGCCTACTCA ACAAGAACGC CAACGCCTCA CCGGTCCCCT ACGCCGCACC AGGCCGCTAC ATCATCCCCG CACTGTCCGG ACGCTCGGAC AACGAGGTCT ACATGCGGGC GATGGGGACC GCGGGCACGA TCTTCCAAAT CGTGTCGCTG CTGTCCTCCG CCAGCGCCAC CCCACAGTGG CGGCTGTACC GCAAGCCGAA AGCCGACGGC AGACAGCGGT ACACCACCGG CGACCGCGGA TCCGACCAGC GCACCGAAGT GCTGCAGCAC CAGGCGCTGA ACGTCTGGAA CAACCCGAAC CCGTTCACCA CCGGCGTCAA CTTCCGCGAA GCCGGCTGGC AGCACATGGA GCTGACCGGC GAACAGTGGT GGGTTGTCGT CCGCGACTCC CGAGCCACCT TCCCGACCGG GCTGTGGCTG GTCCGCCCGG ACCGCATGGA GCCGGTGCCG TCGGCGGAGA AGTACATCGC CGGGTACGTC TACACCGGAC CGTCGGGCGA GCGGGTTCCG CTGCAGCCCG AAGAGGTCAT CCTCACCAAA TACCCGAACC CCCTGGACCC ATACCGCGGG CTCGGGCCGG TGCAGTCGGT GCTGGTCGAC ATCGACGCCA TGAAGTACGG CAGCGAGTGG AACCGGAACT TCTTCATCAA CGGCGCTGTC CCCGGCGGCG TCGTCACTGT GCCGGGAAAC ATGTCGGACG ACGAGTTCGA CCAGTTTTCG ACCCGGTGGC GCGAATCTCA CCAGGGCGTG TCCCGGTCCC ACCGCGTTGC GATCCTCGAG GGTGGCGCGA CCTGGGTTCC GACCCAGATG TCCATCAAGG ACATGGACTT CTCCAACCTG CGGAACGTCA GCAGGGATGT CCTGCGCGAG GCCTGGGGCA TCCACAAGTC GATGCTCGGC AACGCTGACG ATGTCAACCG CTGCCACGAC GACCAGACCG AGGTCCTCAC CAACGGCGGC TGGAAGCCGT TCGAGAAGGT CGAAGACGCC GACCTCATCG CCACCGTTAA CCCCGCGACG CGCCGCATTG AATACCACGC GCCGATCCAC CGCTTCGCCT ACGACTACCA CGGCGACATG GTTCGGATCG TCAACGACGT CATCGACGTG AAGGTCACCC CCAACCACAC GATGCTGTAC GCGACACCGC GGTTGCCGGA CGTTTGGCGT ACATGCCGCG CGGACCAACT CCCGTCCCGG TTCCTAGTTC AGACTGCCCC GGACGCCGAG GACTGCGCGG ACCAGCAGTG GTTTGACCTG CCCGCCGTCG AATACGACAA CGGGCACTAC GGTCGAGGCG GATCTGAACG CCTGCCGATG GATGCGTGGC TGGAGTTCCT GGGCTGGGTT ATCTCCGAGG GCGGCATCCT GTCCGAGGAG CGTGCTGGCA ACCGCTACGT AATGACGCTG GCGCAGAAGA AGTACCCGCA GCGGATCCGT GACTGTCTCG CGCAGCTTCC GTTCTCTGCC TATGAGTACT TCGACGAGGG CGGTCAGATC GCCCGGTGGA ACATCACCGG CAAGGGTCTG ATCACCTGGC TGCGGGAGCA CGTAGGCACC AACTGCCAGG ACAAGCGCAT CCCGCGGTGG TGCTTCAACC TGTCGCTCCG GCAGCGGCGC ATCCTGTTCG ACGCGATGAT GGCTGGCGAC GGCAGTCGCG ATCCACGGCC GGGCCGCTCG AACTGCTACT ACGCGACTAG CTCCCCGGGC CTTGCCGATG ACGTGCTGGA GTTCGTCTTC CGTCTCGGCT GCCGCGCCAA CATGGCGTCA CACCATGACG GACGCAGCGC TCGCGCGCCG ATGTACTACA TCCACATCAC CGACCGGCCG GCGTCCTGGG TGACGGGATC CAACAACGTT TCGACCGAAC GTTACGACGG CAAGGTCTAC AGCCTTGAGG TGCCGAACCA CATCTACGTG ACCCGGCGCA ACGGGCGCAT CGCGATACAC GGCAACTCCA ACGCCCAAAC CGCCGAAGAA GTCTTCGGAC GCTGGAAGAT CATCCCCCGG CTTGACCGGC TCCGCGACAC TCTCAACAAC CAGTTCCTGC CGATGTTCGG CAGCACCGGC GAGAACGTCG AATGGGACTA CGTCAACCCG CTGCCGGACG ACCGCGAAGC CGACAACGCC GAGCTCACCG CGAAGACCAC GGCCTACGCC GCGCTCATCA GCGCAAACGT CAACCCGGAC GACGCTGCCG AAGTGGTCGG CCTGCCGCGG ATGCGCCACA TCATTCAGGT CCCGGCCGCA ACCCCCACTA CCGGCGCCGG CGAACTTGAG CAGGCCGACG ACAGCAGCGG CGAGCCTACC GAGGACGACA ATCCCGTCGA GAACCAGGTG CGCAGTTCCA TCCACGCCCG CACCCTCACC GCGATTACCG CAGCCACAGG CGATTCGGGG CCGACGCCGC CAGCGCAGGG CCCGAACCAG CCGGACCTGT CCCGCGTCGA TGCACAGTGG AAGTCGGCCA CCGACAATGT CGTCGCCGAC TACCAGAGCC AGATCCTGCC CGCCCAGCAG CAGCAGTTGC AGCAGCAGAT CCGCGAACAC GTGGACGCTG CCGAACTAGC CGCGCTCGGA ACGCTGGCCG TCGACACGGT CGCCGCGAAG GCGTTGCTGC TTGCCGCGAT GGTCGCGTTC GCCGTGGTTG CCGCTCGGGA AGCCAGCCGC GAAGCCGACG AGCAAGGCGC CAGCGTGCCG CCGAAGACGC CGGACTCCTC CGAGCTCGAA ACGATCGCCG CAGTAGCTGT CGCCCTTATG GCTTCCGAGC TTGCGGTATC GGCCGGGCGA GAAGCGATGC GGCTTGCCGT TCCCGGAGTT GGCGGGCAGC AGGTTGCGGA CCAGGTTGGG CAGTTCCTGC AAGGCCTGTC CGAAGCCGGC CCCCGCGGGC ACCTGGCCGG AGCGATGTCG GCGGCACAGA ACCGGGCCCG GCACGCCACA TTCACGCAGC ACGGCGCCCC GCGTGGGCAG CTCCGGGCCG TAGAAGTCTT GGACTCGGCA ACCTGCGACA ACTGCGAAGA GATCGACGGG ACCGTCTTCG GCGACACCGA TGACCCGGAC GCCGTGGCCG AGGCGTTCGC CGCGTACCCC ATGGGTGGCT ACACGCTTTG CGAGGGGCGC GAACGTTGCC GCGGAACCCT TTTCGTCGAC TACGACGTCG GGCCTGACTC CGATAACCCC GGTGACGACC TGACTGCAAT GTTGAAGAAG CTCGTCAACC TTCTCGAGCC GGAACCGGCG CACATTAACG GACACCACCA CGAGAAGGCG AGGACGTGA
|
Protein sequence | MVRSMFNVVG SLLNKNANAS PVPYAAPGRY IIPALSGRSD NEVYMRAMGT AGTIFQIVSL LSSASATPQW RLYRKPKADG RQRYTTGDRG SDQRTEVLQH QALNVWNNPN PFTTGVNFRE AGWQHMELTG EQWWVVVRDS RATFPTGLWL VRPDRMEPVP SAEKYIAGYV YTGPSGERVP LQPEEVILTK YPNPLDPYRG LGPVQSVLVD IDAMKYGSEW NRNFFINGAV PGGVVTVPGN MSDDEFDQFS TRWRESHQGV SRSHRVAILE GGATWVPTQM SIKDMDFSNL RNVSRDVLRE AWGIHKSMLG NADDVNRCHD DQTEVLTNGG WKPFEKVEDA DLIATVNPAT RRIEYHAPIH RFAYDYHGDM VRIVNDVIDV KVTPNHTMLY ATPRLPDVWR TCRADQLPSR FLVQTAPDAE DCADQQWFDL PAVEYDNGHY GRGGSERLPM DAWLEFLGWV ISEGGILSEE RAGNRYVMTL AQKKYPQRIR DCLAQLPFSA YEYFDEGGQI ARWNITGKGL ITWLREHVGT NCQDKRIPRW CFNLSLRQRR ILFDAMMAGD GSRDPRPGRS NCYYATSSPG LADDVLEFVF RLGCRANMAS HHDGRSARAP MYYIHITDRP ASWVTGSNNV STERYDGKVY SLEVPNHIYV TRRNGRIAIH GNSNAQTAEE VFGRWKIIPR LDRLRDTLNN QFLPMFGSTG ENVEWDYVNP LPDDREADNA ELTAKTTAYA ALISANVNPD DAAEVVGLPR MRHIIQVPAA TPTTGAGELE QADDSSGEPT EDDNPVENQV RSSIHARTLT AITAATGDSG PTPPAQGPNQ PDLSRVDAQW KSATDNVVAD YQSQILPAQQ QQLQQQIREH VDAAELAALG TLAVDTVAAK ALLLAAMVAF AVVAAREASR EADEQGASVP PKTPDSSELE TIAAVAVALM ASELAVSAGR EAMRLAVPGV GGQQVADQVG QFLQGLSEAG PRGHLAGAMS AAQNRARHAT FTQHGAPRGQ LRAVEVLDSA TCDNCEEIDG TVFGDTDDPD AVAEAFAAYP MGGYTLCEGR ERCRGTLFVD YDVGPDSDNP GDDLTAMLKK LVNLLEPEPA HINGHHHEKA RT
|
| |