Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2474 |
Symbol | |
ID | 7969539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 2617139 |
End bp | 2618590 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644793057 |
Product | major capsid protein HK97 |
Protein accession | YP_002944366 |
Protein GI | 239815456 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.690827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGC ATTCGCTGGC CCTGACCGCC TTGGCAGTCG CCGCAGCCTT CGGCCTCTCG GCCGCCGCCA TCGTGCGCAA CGATTCCGTT GCCACCCTCG ACACCCTGCA GAACCGGCTC ATCGAGCTCA AGGACGCGGG CAACAACATC CAGGCCCGCG CCGATGCCGA AAAGCGCGAC CTGACCGCTG ACGAGCAGGA AGAGATCAAG CAGATCTTCG CCTCGTTCGA AGCTGTCGAG GCCGACATCG AGCGCCGCGA ACAGCTGGAC GCGATGAACG CCAAGATCTC GCAGCCCGCC GGCCGCAAGA CCGCACCCGA GTTGCAGGAC GACGACGAGC CCGCACAGCC GCAGGCCCGC ACGACCGCCA AGCACAAGCC GATCTTCGCC ACCCCGCGCT CGGCCGACGC CAACAAGTGG GGTTTCCGCT CCCAAGCCGA GTTCTTCAAC GCCGTGGTGA AGTCGTCCGC CAAGGGTGCG CAGACCGACC CGCGCCTGAT CGCCAACGCG CCGACCACGT TCGGCTCGGA AGGCGTTGGC GCCGACGGTG GCTTCGCCGT GCCGCCGGAC TTCCGCAACA CCATCATTCA GAAGGTGATG GGTGAGGACT CGCTGCTGTC TCTGACCGAC CAACAAATCA GTTCGGGCAA CAGCATCACG TTCCCGGCCG ACGAAACCAC CCCGTGGCAG TCGAGCGGCG GCATCCAGGC CTACTGGGAA GTCGAAGGCG GCCAGAAGAC GCAATCGAAG CCTGCGCTGG TCGAGAAGAC CGTGAAGCTG AACAAGGTGA TCGCACTGGT GCCGCTCACC GACGAACTGC TGGAAGACGC GCCGGCCATG GCCAGCTACG TCAACCGCAA GGCGCCGGAG AAGATCGTGT TCAAGGTGAA CGACGCCATC ATCAACGGCA CCGGCGTTGG CATGCCCTTG GGTATCCTGA AGTCGCCCGG CACGGTGATC GTCGCCAAGG AAGGCAGCCA GACCGCCGAC ACGGTGGTTT TCGCCAACCT GACCAAGATG TGGACGTCCC TGACGCCAAT GGCACGCCGC AATGCGCGCT GGCTCATGAA CGCCGACGTC GAAGGCCAGC TGATGGGCAT GTCCTTCCCC GGCACCGGCA CCGCGGTTCC CGTCTACATG CCGCCTGGCG GCCTCTCGGC TGCCCCCTAC GGCACGCTGT TCGGTCGTCC GATCATGTAC TCGGAAGCCA TGCCGGCCCT GGGCGATGAA GGCGACATCC TGTTCGGCGA CCTGTCGAAC TACCTGTCGG GCGTGAAGGC CGGCGGCGTC AAGTCGGACG TGTCGATCCA CGTCTGGTTC GATTACGACA TCACCGCGTT CCGCTTCGTG CTGCGCGTCG GTGGCCAGCC GTGGTGGAAC GCTCCAGTCG CGCCGTACCA AGCCGGCGCA TCGAGCCGCG GCTTCTTCGC TGCCCTGGGC GCTCGCGCCT GA
|
Protein sequence | MKKHSLALTA LAVAAAFGLS AAAIVRNDSV ATLDTLQNRL IELKDAGNNI QARADAEKRD LTADEQEEIK QIFASFEAVE ADIERREQLD AMNAKISQPA GRKTAPELQD DDEPAQPQAR TTAKHKPIFA TPRSADANKW GFRSQAEFFN AVVKSSAKGA QTDPRLIANA PTTFGSEGVG ADGGFAVPPD FRNTIIQKVM GEDSLLSLTD QQISSGNSIT FPADETTPWQ SSGGIQAYWE VEGGQKTQSK PALVEKTVKL NKVIALVPLT DELLEDAPAM ASYVNRKAPE KIVFKVNDAI INGTGVGMPL GILKSPGTVI VAKEGSQTAD TVVFANLTKM WTSLTPMARR NARWLMNADV EGQLMGMSFP GTGTAVPVYM PPGGLSAAPY GTLFGRPIMY SEAMPALGDE GDILFGDLSN YLSGVKAGGV KSDVSIHVWF DYDITAFRFV LRVGGQPWWN APVAPYQAGA SSRGFFAALG ARA
|
| |