Gene Vapar_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2472 
Symbol 
ID7969537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2614754 
End bp2616382 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content65% 
IMG OID644793055 
Productphage portal protein, HK97 family 
Protein accessionYP_002944364 
Protein GI239815454 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTGA ACCCGCTCAC CGCAGTACGG CAGGCATTTG GCGCTCGCGC CGAAGCCACG 
CCCGAGCCGG GCGTCGTCGC CTTGCAGCGC ATCGCCATCG AAATGACAGG CGGCGGCCCG
ACCGAAATCT ATGACGTCGT GCACGCCGGG CCGACGAATC CGAAGAGCAC CGCGTGGTGG
CCTGGCCGCA GCAACGACGC CGGCGTTCAT GTGACGCATG GCATCGCCTT CATGCAAGCC
GCGGTCTGGG CATGCATCGA CGCGATCGCT TCGGCCATCG CCTCTTCGGA CTGGAACGTC
TACGCTGGTG TGCGCGGGGC CGACGACAAG AAGGTGATCC CGGAGGATCG CCTGCAATAC
GTGCTGAACA CGCGGTTCAA TCCCGAGATG ACTGCGCAGG CCGGCAAGCG AGCCATGGGC
ATCGGAGCCG CCGGCTACGG AAACGGCATC GCGGAAATCG AATGGGACAT GGCCGGCCGG
CTGGCCTGGT TGTGGCCGAT CTCTCCCGAT CGCGTCGAGA TGGTGCGAAA CCAGTTCGGG
CGGCTGGTCT ATAGGGTCAC GCAGGACTAC CAGGGCGGCT TCGTCGATCT GGAGCCGGAA
GACGTCTTCC ACATCCGCGG CGCCAGCCTG ACCGGCCTTG CTGGTGATGA CATGGTGGCG
AAGGCCATTA AGACGATCGC GCGCTCTGTG GCTGTCGATC AGTTCGCCTC GTCCTACTTC
GCGAACGGGA CGCAGATGGG CGGCGTGCTC GAGTACCCGA ACAAGCTCGA TGACCCGACC
TTCGAGCGCC TGAAAACGCA GATCAACGAC AAGCATCAGG GCGCGCGCAA CGCCTTCCGC
ACGCTCTTTC TGGAGGCTGG CGGCAAGTTC ACCCAGTTCG CCGCAGACGC GGACAAGTCG
CAGCTGGTCG ACGTCAAGAA CCAGCTGATC GAAGAGGTGT GCCGCTGGTT CCGGGTGCCT
CCGCACAAGA TCGCGCACCT GTTGCGCGCC ACGAACAACA ACATAGAGCA CCAGGGCTTG
GAGTTCACGC GCGACACGCT GCGCCCCTGG GTCAAGGAGA TCGAGCAGGA AGCCGACTTC
AAGTTGTTTT CGACCCGCGG CCCGAAGAAG TTTGTCGAGC TCGACGTCGA TTGGGCAGAG
CAGGGCGATT ACGGCAGCCG GATGACGGCG TACAGCACCG CGCGCGGCAT GGGCGTCTTC
AGTGTCAACG ACGTGCTGCG CAAGCTCGGA GAGAACACGA TCGGCCCCGT TGGCGACGTC
CGGACCATGA ACGGCGCCGC GGTGCGCTTG GAGGACGTCG GCAAGAACAT GATGCCAGCA
CCGGCGCCAG CCGCCACGCC GGCCCCAGCG CCCGCGGCGA ACGCCGGCCA GGCCGGCGAT
GTGGCGCAGG CTTGGCTCAC GTCGGTCTAC GCGCGCATTC AGCGGCGTTT CGACAACCGG
AAGGATGCCG CGGGCGCCTC TGCCGCGCTG CAGGACGCCA AGATTTACGC GGCAGAACAG
GTGGCAGAGA TGGCGGAGGC GCTCGGCGAC CGCATCGAAG CCGCACAGGC CAAAGCGATC
GAGATGGTGG GCAGCCCATT GCTCCCCGCG GACGCCGCGG CGGAAGTATT CAAGAAGGAA
CCGGCATGA
 
Protein sequence
MALNPLTAVR QAFGARAEAT PEPGVVALQR IAIEMTGGGP TEIYDVVHAG PTNPKSTAWW 
PGRSNDAGVH VTHGIAFMQA AVWACIDAIA SAIASSDWNV YAGVRGADDK KVIPEDRLQY
VLNTRFNPEM TAQAGKRAMG IGAAGYGNGI AEIEWDMAGR LAWLWPISPD RVEMVRNQFG
RLVYRVTQDY QGGFVDLEPE DVFHIRGASL TGLAGDDMVA KAIKTIARSV AVDQFASSYF
ANGTQMGGVL EYPNKLDDPT FERLKTQIND KHQGARNAFR TLFLEAGGKF TQFAADADKS
QLVDVKNQLI EEVCRWFRVP PHKIAHLLRA TNNNIEHQGL EFTRDTLRPW VKEIEQEADF
KLFSTRGPKK FVELDVDWAE QGDYGSRMTA YSTARGMGVF SVNDVLRKLG ENTIGPVGDV
RTMNGAAVRL EDVGKNMMPA PAPAATPAPA PAANAGQAGD VAQAWLTSVY ARIQRRFDNR
KDAAGASAAL QDAKIYAAEQ VAEMAEALGD RIEAAQAKAI EMVGSPLLPA DAAAEVFKKE
PA