Gene Vapar_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2474 
Symbol 
ID7969539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2617139 
End bp2618590 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content66% 
IMG OID644793057 
Productmajor capsid protein HK97 
Protein accessionYP_002944366 
Protein GI239815456 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.690827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGC ATTCGCTGGC CCTGACCGCC TTGGCAGTCG CCGCAGCCTT CGGCCTCTCG 
GCCGCCGCCA TCGTGCGCAA CGATTCCGTT GCCACCCTCG ACACCCTGCA GAACCGGCTC
ATCGAGCTCA AGGACGCGGG CAACAACATC CAGGCCCGCG CCGATGCCGA AAAGCGCGAC
CTGACCGCTG ACGAGCAGGA AGAGATCAAG CAGATCTTCG CCTCGTTCGA AGCTGTCGAG
GCCGACATCG AGCGCCGCGA ACAGCTGGAC GCGATGAACG CCAAGATCTC GCAGCCCGCC
GGCCGCAAGA CCGCACCCGA GTTGCAGGAC GACGACGAGC CCGCACAGCC GCAGGCCCGC
ACGACCGCCA AGCACAAGCC GATCTTCGCC ACCCCGCGCT CGGCCGACGC CAACAAGTGG
GGTTTCCGCT CCCAAGCCGA GTTCTTCAAC GCCGTGGTGA AGTCGTCCGC CAAGGGTGCG
CAGACCGACC CGCGCCTGAT CGCCAACGCG CCGACCACGT TCGGCTCGGA AGGCGTTGGC
GCCGACGGTG GCTTCGCCGT GCCGCCGGAC TTCCGCAACA CCATCATTCA GAAGGTGATG
GGTGAGGACT CGCTGCTGTC TCTGACCGAC CAACAAATCA GTTCGGGCAA CAGCATCACG
TTCCCGGCCG ACGAAACCAC CCCGTGGCAG TCGAGCGGCG GCATCCAGGC CTACTGGGAA
GTCGAAGGCG GCCAGAAGAC GCAATCGAAG CCTGCGCTGG TCGAGAAGAC CGTGAAGCTG
AACAAGGTGA TCGCACTGGT GCCGCTCACC GACGAACTGC TGGAAGACGC GCCGGCCATG
GCCAGCTACG TCAACCGCAA GGCGCCGGAG AAGATCGTGT TCAAGGTGAA CGACGCCATC
ATCAACGGCA CCGGCGTTGG CATGCCCTTG GGTATCCTGA AGTCGCCCGG CACGGTGATC
GTCGCCAAGG AAGGCAGCCA GACCGCCGAC ACGGTGGTTT TCGCCAACCT GACCAAGATG
TGGACGTCCC TGACGCCAAT GGCACGCCGC AATGCGCGCT GGCTCATGAA CGCCGACGTC
GAAGGCCAGC TGATGGGCAT GTCCTTCCCC GGCACCGGCA CCGCGGTTCC CGTCTACATG
CCGCCTGGCG GCCTCTCGGC TGCCCCCTAC GGCACGCTGT TCGGTCGTCC GATCATGTAC
TCGGAAGCCA TGCCGGCCCT GGGCGATGAA GGCGACATCC TGTTCGGCGA CCTGTCGAAC
TACCTGTCGG GCGTGAAGGC CGGCGGCGTC AAGTCGGACG TGTCGATCCA CGTCTGGTTC
GATTACGACA TCACCGCGTT CCGCTTCGTG CTGCGCGTCG GTGGCCAGCC GTGGTGGAAC
GCTCCAGTCG CGCCGTACCA AGCCGGCGCA TCGAGCCGCG GCTTCTTCGC TGCCCTGGGC
GCTCGCGCCT GA
 
Protein sequence
MKKHSLALTA LAVAAAFGLS AAAIVRNDSV ATLDTLQNRL IELKDAGNNI QARADAEKRD 
LTADEQEEIK QIFASFEAVE ADIERREQLD AMNAKISQPA GRKTAPELQD DDEPAQPQAR
TTAKHKPIFA TPRSADANKW GFRSQAEFFN AVVKSSAKGA QTDPRLIANA PTTFGSEGVG
ADGGFAVPPD FRNTIIQKVM GEDSLLSLTD QQISSGNSIT FPADETTPWQ SSGGIQAYWE
VEGGQKTQSK PALVEKTVKL NKVIALVPLT DELLEDAPAM ASYVNRKAPE KIVFKVNDAI
INGTGVGMPL GILKSPGTVI VAKEGSQTAD TVVFANLTKM WTSLTPMARR NARWLMNADV
EGQLMGMSFP GTGTAVPVYM PPGGLSAAPY GTLFGRPIMY SEAMPALGDE GDILFGDLSN
YLSGVKAGGV KSDVSIHVWF DYDITAFRFV LRVGGQPWWN APVAPYQAGA SSRGFFAALG
ARA