Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_6186 |
Symbol | |
ID | 7381241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 1192103 |
End bp | 1193518 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643649663 |
Product | phage major capsid protein HK97 family |
Protein accession | YP_002547887 |
Protein GI | 222107096 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.423125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATGA GATTCAACCC TGCCATCGCG CTCGGTCTCG TTCTTCTCTC CGCCCTCCTG TTTCTTCTGG TCCTGGGTGG TGGTGCGCAT GCCGCCGCGA TCCTCCCTCT GGATGCGCAT GTCACCGGCT TTGCCATGGT GATGACGGTT GCGCCGCAAC TCAATCATCG TGCGCGTGGT CTGGTGATTG GTGTGCGCAA CGAAGGCGAT GCGCAAAAGA TTTTTGAAGA GCTGAAAAAG AGCGTGGAAG CGTTCAAGGC TTCCCATGAG GAAGAATTGA AGGGCATCAA GAATAAGTTC GCCGATGTCG TGACGACTGA GAAGGTCGAT AAGATCAACA ACGAAATCAC TTCGTTGCAA AAGGCGCTGG ATGATGTCAA CGCCATGATG GCCGCCTCCA AGCTTGGCGG TGCTGGCGAT GGCGTGGAAA GTGCTGATCA GCGCGAACAT CGCGGCGCGT TCAACAAGTG GTTCCGCAAA GGCGCGGACG CTGGCCTTGC AGACCTTGAA GTCAAGGCGG CCTTGACGAC GCAATCTGAC CCGGACGGCG GCTTTCTGGT GCCGACGCAA ACAGAAACCA CGATTGACCG GGTTCTCGGC ACTGTCAGCA CCATGCGGCA ACTTGCGACC GTTATGCCGG TTGGCACGGA CACCTATACC AAGTTCGTCA ACATGGGTGG GTCTGGCGCC GGTTGGGTCG GCGAGGAAGA GTCCCGCCCG GAAACCGGCA CTCCGACCCT GCGTGAAATC GTGCTGACGG TCATGGAGCT TTATGCCAAT CCGTTCACCA CGCAAAAGAT GTTGGATGAT GGCATTATCG ACATCGCCAC CTGGCTTGCC GATGAAGTCA GCATTACATT TGCTGAAAAG GAAGGCGCCG CCTTCATCAG CGGCGATGGC GTCAAAAAAC CACGCGGCAT CCTGGCTTAC GATACCATTG CCAATGCAAG CTACGCCTGG GGAAGTCTTG GCTTCGTCGT TTCTGGCGGT GCAAGTGGGT TTGCATCTTC CGCGCCTGCC GACGCCTTCA TTGATCTGTA TTATGGCCTG AAAGCCGGAT ACCGGACCAA CGCATCATTC TTGACATCTG ATGCGACGAT GGGGTCGATC CGTAAAATGA AGGACGGTCA GGGGAATTAC CTGTGGCGCG ACCCTTCCGC TCCTGGTGAA GTTCCGACAA TCCTCGGCAA GCCAGTTTAC ACCGACGACA ACATGCCTGC CGTGGCAGCC AATGCGTTCC CTGTTGCCTT TGGTGACTTC AAGCGCGGCT ATATGGTTGC GGACCGCACT GGCATCCGGG TTCTGCGCGA CCCATACACC AATAAGCCAA AGGTGGGTTT TTATACCACC AAGCGCGTTG GCGGCGGTGT GACCAACTTC GAAGCCATCA AGCTCCTGAA GATCGGCACA AGCTGA
|
Protein sequence | MSMRFNPAIA LGLVLLSALL FLLVLGGGAH AAAILPLDAH VTGFAMVMTV APQLNHRARG LVIGVRNEGD AQKIFEELKK SVEAFKASHE EELKGIKNKF ADVVTTEKVD KINNEITSLQ KALDDVNAMM AASKLGGAGD GVESADQREH RGAFNKWFRK GADAGLADLE VKAALTTQSD PDGGFLVPTQ TETTIDRVLG TVSTMRQLAT VMPVGTDTYT KFVNMGGSGA GWVGEEESRP ETGTPTLREI VLTVMELYAN PFTTQKMLDD GIIDIATWLA DEVSITFAEK EGAAFISGDG VKKPRGILAY DTIANASYAW GSLGFVVSGG ASGFASSAPA DAFIDLYYGL KAGYRTNASF LTSDATMGSI RKMKDGQGNY LWRDPSAPGE VPTILGKPVY TDDNMPAVAA NAFPVAFGDF KRGYMVADRT GIRVLRDPYT NKPKVGFYTT KRVGGGVTNF EAIKLLKIGT S
|
| |