Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1970 |
Symbol | |
ID | 7387256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 1625195 |
End bp | 1626472 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643651219 |
Product | major capsid-like protein |
Protein accession | YP_002549415 |
Protein GI | 222148458 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.244233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTATGT CGCAAAGACT GAAAGAATTG CGCGAAAAGC AGGCGCGCCT TGTTACCGAG GCCCGCGAAC GGCTGGAGGC GATTACGCCT GACACCGATG AGGTTCGGTC CAAGGAACTG GAAAGCGCTC ATGATACGGC GATGGCCGAA TATGATCGGC TGCAAGGTAT CCTCGACCGG GAAGAAAAAC TGATCGAGCT GGAAAAGCGC GATGAAGAGC GCCGCGCCAA ACAGCGCCCG TTGCGCGATA CCCCGGAAAT GCGTGGCGAC GATCTACCCC AGGGTGGCAA GGTCGAATAT CGCAGCGTGT TTGCCAAGGT GGTCTGCGGT GTCAGTCCGT CCGATCTGAC CAAGGAAGAG CGCGCGGTGC TGCAGCGTGG CGCTGCCTCG TTTGAGAGCC GCGATCAGGT GACAACCAGT GCCGTTGCGG GTGGCTATAC CGTGCCGGTC GAGCTGGAAG CCACGATCAT CCAGGCGATG AAAGCCTGGG GGCCGCTTTA TGACGAAAAC ATCTGCACAG TGATCACCAC GTCGAAGGGC AACGAAATGC TGGTGCCGAC CGTTGACGAT ACCGACAACG AGGCCGATCC CCTGGCCGAG GCGGCCGATC TCCTGGAAGA TGGCAGCGGC GATGTCGAGT TCGGGCAAAA GGTGCTGAAC GCCTATGTCT ATGCCACGCC GTTCATCAAG TGGTCCTTCG AGCTGGATGC GGATTCGCTG TTCAACATGG AATCGCTGCT CGGCGGCATG ATCGGTGAGC GCCTCGGTCG GATCGGGAAC CGCAAGCTGA CATCGGGAAC CGGCCAAAAC CAGCCGAACG GCGTGGTGAA TGCGGCGGGC CTCGGCGTGA CCACGGCGGC CAAGGATGCC TTCACCTTCG ACAACATCCT CGAACTGGAG CATTCCATTG ATCCTGCTTA TCGTGGCTCG CCCAAATGCC GGTACATGTT CCATGACAAG TTCCTGCTGG CCACCCGCAA ACTGAAGGAT GGCAACGGCA ATTACCTTTG GCAGCAGGGG GATGTGCAGA AGGGAACACC GGCCAGCTTC AACGGTCGCG CCTATTCGAT CAATCAGCAC ATGGATGAAG TGACGGCCGA CAAGCGGATC GCGCTGTTTG GCGACTTCTC CAAATACTAC GTCCGCAAGG TCGGGTCGCC CGTCATCGGC GTGCTGCGCG AACGCTTCTG GCCGAAGGTC GGCATTGCCG GCCTGATCCG CTTCGACGGC GAGCTGGGCG ATGCCAACGC CATCAAGGCC ATGAAGACGG CGGCCTGA
|
Protein sequence | MTMSQRLKEL REKQARLVTE ARERLEAITP DTDEVRSKEL ESAHDTAMAE YDRLQGILDR EEKLIELEKR DEERRAKQRP LRDTPEMRGD DLPQGGKVEY RSVFAKVVCG VSPSDLTKEE RAVLQRGAAS FESRDQVTTS AVAGGYTVPV ELEATIIQAM KAWGPLYDEN ICTVITTSKG NEMLVPTVDD TDNEADPLAE AADLLEDGSG DVEFGQKVLN AYVYATPFIK WSFELDADSL FNMESLLGGM IGERLGRIGN RKLTSGTGQN QPNGVVNAAG LGVTTAAKDA FTFDNILELE HSIDPAYRGS PKCRYMFHDK FLLATRKLKD GNGNYLWQQG DVQKGTPASF NGRAYSINQH MDEVTADKRI ALFGDFSKYY VRKVGSPVIG VLRERFWPKV GIAGLIRFDG ELGDANAIKA MKTAA
|
| |