Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cla_0852 |
Symbol | |
ID | 7410432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Campylobacter lari RM2100 |
Kingdom | Bacteria |
Replicon accession | NC_012039 |
Strand | - |
Start bp | 807382 |
End bp | 808542 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643717982 |
Product | major capsid protein, HK97 family |
Protein accession | YP_002575429 |
Protein GI | 222823855 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.0590991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA TAAGAGAAGA AATAGGAGCT TTACATAAGC AGATGTTAGA GCTTTCTAAT AAAGCAAAAA ATGAAAAAAG AGCATTTAGC ACAGATGAAG AACAAAAATA TAATGAAATG CTTAAAGATT TTGAAAGCAA ACGCAGCGAG CTTTTAAGAC TTGAAGAAGA GCAAAAAAGA GAAAAATTTT TAAATGAAGT TACAAGTATA AGATTAGAAC AGAATCCACA TCCACAAAAA GAAGAAGAAA GAGACTCTAT GAGATCTTTT GTTAAATACT TAAGAAGTGG AGTTATAGAC ACATCTTTGC AAAGAGATGC ATTAAACGAA GCTTCAGGTG ATAAAGGCGG TGTTTTAGTT CCTACAACAT TGCAATCAAA AATTAGATCA AAATTGACAG ATCTTAGTGT AATTAGGAAA ATAGCTACTG TGCAAAAATC ATCCACAAAT CAAGATATAC CTATACTTGA AGATATAAGT GGTTTTGGTT GGATTGATGA AATGGCTAGT TTTAATGAGG CAAGCGCTTC TTTTTCAAAA TTAACTATTG GAGCATACAA ATTGGGAGGT ATTATTAAAA TTAGTGAAGA ATTACTAAAT GATAATATCT CAAATTTAGA AAGTTTTTTA ATTAGAAAGA GTGCCGAAAA AATTGCACAA GCAGAGGAAG AAGCATTTAT AAAAGGCGAT GGAAATAAAA AACCAACAGG TATTGTAAAT ACAAAAACAA AATATGAATT AGCAAGTAAT AGTGGCATTA CAAGCAATGA TGTAATTGAT GCATTTTTTA CATTAAAAAG TGCATATAGA TCTAATGCAT GCTGGCTTGT TGGGGATGAT TTTATGAAAG CTCTTTATAA ATTAGTCGAT GGAGATGGTA GACCTTTATG GATGCCAGCG CTTAGTTCTG GTGGATATGA CACTATTTTA GGTAAAAAAG TAATCTATTG TTCTTCATTA GATGGTTTTG GTGCTAATAA AATTCCTGCT ATCTTCGGTG ATTTTAGTTT TTATGAAATT TGGGATAGAG AAACTATGAG CTTTACGAGA TTAAATGAAT TGTATGCTCA AAATGATTTA GTTGGAATTA AAGTTAGATC AAGACTTGAT GCTAAATTAA TCACAGATGA AGCAGTTTGT AAAATAGTAA CTCCTTCTTA A
|
Protein sequence | MQKIREEIGA LHKQMLELSN KAKNEKRAFS TDEEQKYNEM LKDFESKRSE LLRLEEEQKR EKFLNEVTSI RLEQNPHPQK EEERDSMRSF VKYLRSGVID TSLQRDALNE ASGDKGGVLV PTTLQSKIRS KLTDLSVIRK IATVQKSSTN QDIPILEDIS GFGWIDEMAS FNEASASFSK LTIGAYKLGG IIKISEELLN DNISNLESFL IRKSAEKIAQ AEEEAFIKGD GNKKPTGIVN TKTKYELASN SGITSNDVID AFFTLKSAYR SNACWLVGDD FMKALYKLVD GDGRPLWMPA LSSGGYDTIL GKKVIYCSSL DGFGANKIPA IFGDFSFYEI WDRETMSFTR LNELYAQNDL VGIKVRSRLD AKLITDEAVC KIVTPS
|
| |