Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2472 |
Symbol | |
ID | 7969537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 2614754 |
End bp | 2616382 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644793055 |
Product | phage portal protein, HK97 family |
Protein accession | YP_002944364 |
Protein GI | 239815454 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTGA ACCCGCTCAC CGCAGTACGG CAGGCATTTG GCGCTCGCGC CGAAGCCACG CCCGAGCCGG GCGTCGTCGC CTTGCAGCGC ATCGCCATCG AAATGACAGG CGGCGGCCCG ACCGAAATCT ATGACGTCGT GCACGCCGGG CCGACGAATC CGAAGAGCAC CGCGTGGTGG CCTGGCCGCA GCAACGACGC CGGCGTTCAT GTGACGCATG GCATCGCCTT CATGCAAGCC GCGGTCTGGG CATGCATCGA CGCGATCGCT TCGGCCATCG CCTCTTCGGA CTGGAACGTC TACGCTGGTG TGCGCGGGGC CGACGACAAG AAGGTGATCC CGGAGGATCG CCTGCAATAC GTGCTGAACA CGCGGTTCAA TCCCGAGATG ACTGCGCAGG CCGGCAAGCG AGCCATGGGC ATCGGAGCCG CCGGCTACGG AAACGGCATC GCGGAAATCG AATGGGACAT GGCCGGCCGG CTGGCCTGGT TGTGGCCGAT CTCTCCCGAT CGCGTCGAGA TGGTGCGAAA CCAGTTCGGG CGGCTGGTCT ATAGGGTCAC GCAGGACTAC CAGGGCGGCT TCGTCGATCT GGAGCCGGAA GACGTCTTCC ACATCCGCGG CGCCAGCCTG ACCGGCCTTG CTGGTGATGA CATGGTGGCG AAGGCCATTA AGACGATCGC GCGCTCTGTG GCTGTCGATC AGTTCGCCTC GTCCTACTTC GCGAACGGGA CGCAGATGGG CGGCGTGCTC GAGTACCCGA ACAAGCTCGA TGACCCGACC TTCGAGCGCC TGAAAACGCA GATCAACGAC AAGCATCAGG GCGCGCGCAA CGCCTTCCGC ACGCTCTTTC TGGAGGCTGG CGGCAAGTTC ACCCAGTTCG CCGCAGACGC GGACAAGTCG CAGCTGGTCG ACGTCAAGAA CCAGCTGATC GAAGAGGTGT GCCGCTGGTT CCGGGTGCCT CCGCACAAGA TCGCGCACCT GTTGCGCGCC ACGAACAACA ACATAGAGCA CCAGGGCTTG GAGTTCACGC GCGACACGCT GCGCCCCTGG GTCAAGGAGA TCGAGCAGGA AGCCGACTTC AAGTTGTTTT CGACCCGCGG CCCGAAGAAG TTTGTCGAGC TCGACGTCGA TTGGGCAGAG CAGGGCGATT ACGGCAGCCG GATGACGGCG TACAGCACCG CGCGCGGCAT GGGCGTCTTC AGTGTCAACG ACGTGCTGCG CAAGCTCGGA GAGAACACGA TCGGCCCCGT TGGCGACGTC CGGACCATGA ACGGCGCCGC GGTGCGCTTG GAGGACGTCG GCAAGAACAT GATGCCAGCA CCGGCGCCAG CCGCCACGCC GGCCCCAGCG CCCGCGGCGA ACGCCGGCCA GGCCGGCGAT GTGGCGCAGG CTTGGCTCAC GTCGGTCTAC GCGCGCATTC AGCGGCGTTT CGACAACCGG AAGGATGCCG CGGGCGCCTC TGCCGCGCTG CAGGACGCCA AGATTTACGC GGCAGAACAG GTGGCAGAGA TGGCGGAGGC GCTCGGCGAC CGCATCGAAG CCGCACAGGC CAAAGCGATC GAGATGGTGG GCAGCCCATT GCTCCCCGCG GACGCCGCGG CGGAAGTATT CAAGAAGGAA CCGGCATGA
|
Protein sequence | MALNPLTAVR QAFGARAEAT PEPGVVALQR IAIEMTGGGP TEIYDVVHAG PTNPKSTAWW PGRSNDAGVH VTHGIAFMQA AVWACIDAIA SAIASSDWNV YAGVRGADDK KVIPEDRLQY VLNTRFNPEM TAQAGKRAMG IGAAGYGNGI AEIEWDMAGR LAWLWPISPD RVEMVRNQFG RLVYRVTQDY QGGFVDLEPE DVFHIRGASL TGLAGDDMVA KAIKTIARSV AVDQFASSYF ANGTQMGGVL EYPNKLDDPT FERLKTQIND KHQGARNAFR TLFLEAGGKF TQFAADADKS QLVDVKNQLI EEVCRWFRVP PHKIAHLLRA TNNNIEHQGL EFTRDTLRPW VKEIEQEADF KLFSTRGPKK FVELDVDWAE QGDYGSRMTA YSTARGMGVF SVNDVLRKLG ENTIGPVGDV RTMNGAAVRL EDVGKNMMPA PAPAATPAPA PAANAGQAGD VAQAWLTSVY ARIQRRFDNR KDAAGASAAL QDAKIYAAEQ VAEMAEALGD RIEAAQAKAI EMVGSPLLPA DAAAEVFKKE PA
|
| |