Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3486 |
Symbol | |
ID | 5900941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3764291 |
End bp | 3765475 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641563992 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001685111 |
Protein GI | 167647448 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.168874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATTG AAAAACTAGA TGGTCTGATC GCAGAAATGC GCCTGAAGGC CGATGCAACT GACGTAGTTG CAAAGGGTCA ACTGGACGAC CTAAGCGCTC ACGTTAAGTC GGAAATCGCG GCAATCCGCT CCGACATGGC TGCACAAAAT GCACCACGCG AAGACGCACA AGACACCGCT GAAATCCACC AAAAGGCCGT TGCCGAATTC ATGGTCCACG GCTCCAAGAT GGTCGAAGCC GGTAAGATCA AGGCCACTGA CTACGCAGTC AGCGTCTCGG CCGATGGCGG CATCACCGTC CCAACCAACA TCCACACCGA CGTTGTCGAA AAGCTCCGTA AGTCGAGCCC ACTGCTCGGC CTATCGACCG TGACCTCGCT GAAGGGTCTG CAAAAGCTCC TGCTGAAGAC CAGCGCTGCT CAAGCTAACA CCCGTTCAGA GCGCGGCGCG TTCACCGACA ACACCGTTGA AGGCTTCGCA GGTATCACTC TGGGCTCGAA GGACGTTTAC GACCGTCAAG CCCACACTGT TGAATCGGTC GAAGGCGACT CCGTTCTGGA CTTCCAGCAA GTCCTACTGG CTTCGATGAA TGCCGGTATC GCTGAGAAGA TGGCGAGCGA ACTGCTGAAC AGCACCGTGA CCAACGCCGT TGAAAACGTC GCAGGCACAA CCACCATCCC AGCCGGTATC CTGAACCGCG TGACCGAAGT TGCGGCTGAC CGCTTCACCG GTAGCATCGG CAAGACCCCT TGCCTGACCA CTGCCGGTAT CAACCTCGTC ACTTTCGAGG ACATGATTAA GCTCTACAAC AGCCTGCACA CCAAGTACCG CAACTCGGCG ACCTTCGTCA CCGGCAGCGA TATTGAGCTG CAACTGATGC TCGTGAAGGA CTCGACCGGC AACTACATCT GGCAACCAGC CGTGGGTCAG ACCTACCAAG CTGTTATCCA GGGCCGTCCT GTCGTCATCG ACGACTTCAT GCCCGGCGTG ACCGGTTCGG CAGGCGTTCC TCGCGTTCTG TTCGCCGACT TCTCGGAAAA CTACGTCAAC TTCTCGGGTC CAATGCAGTG GGTCGTCGAC CCATACACCG CACCTGGCTA CGTGAAGTAC ACCGCGCGTC AGCGTTTCGG TTCGGTCTTC CGCGACAGCC AAGCAATCCG TGGTCTGCTC CTAAAGACCT CGTAA
|
Protein sequence | MSIEKLDGLI AEMRLKADAT DVVAKGQLDD LSAHVKSEIA AIRSDMAAQN APREDAQDTA EIHQKAVAEF MVHGSKMVEA GKIKATDYAV SVSADGGITV PTNIHTDVVE KLRKSSPLLG LSTVTSLKGL QKLLLKTSAA QANTRSERGA FTDNTVEGFA GITLGSKDVY DRQAHTVESV EGDSVLDFQQ VLLASMNAGI AEKMASELLN STVTNAVENV AGTTTIPAGI LNRVTEVAAD RFTGSIGKTP CLTTAGINLV TFEDMIKLYN SLHTKYRNSA TFVTGSDIEL QLMLVKDSTG NYIWQPAVGQ TYQAVIQGRP VVIDDFMPGV TGSAGVPRVL FADFSENYVN FSGPMQWVVD PYTAPGYVKY TARQRFGSVF RDSQAIRGLL LKTS
|
| |