Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1338 |
Symbol | |
ID | 7389083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 1122759 |
End bp | 1124021 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643650754 |
Product | phage major capsid protein HK97 family |
Protein accession | YP_002548960 |
Protein GI | 222148003 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0373474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATC ATCAGATGAC TGCCCCCGAA ATCAAGGCGA TCCCGGAAAC CATGGCCGCC GCCTTCGACG ACTTTATGGA AGCCTTCGAA GGCTTCAAGC AGGCCAATGA CCAGCGGCTC GGCGAAATCG AGCAGAAACT GACCGCCGAT GTCGTCACCC GCGACAAGGT GGAGCGGATC AACAAGGCGA TGGACGAACA GTCCCGGCTG CTGGACCAGC TGGCGCTGAA AAAGCTGCGC CCGGCGCTGG GATCGAACGG CAATCGTGGC GGTTCCGGCA GCCTGGAGGC GACAGAGCGC AAGGCGGCCT TCGAGGCCTA TATCCGGCGC GGCGATGAAA CGGCGCTGCG CGATCTCGAT GCCAAATCCA TGGCGATTGG TTCGGATGCG GATGGCGGAT ATCTGGTGAC GGATGAGACC GATAGTGAGA TTGGCCGCAG GCTCGCCTCC ATCTCGCCGA TCCGGCAATT GGCCAGCGTC CGGCAGGTGT CGGGCGCGGT GCTGAAAAAG CCGTTTGCCC CAAGCGGCAT GGCCTCCGGC TGGGTGGCGG AAACCGCGGC CCGCACCCAG ACCGATACGC CGCAGCTGAC GGAACTTTCC TTTCCGACCA TGGAGATTTA CGCCATGCCC GCCGCCACCC AGTCGCTGCT GGATGATGCT GCTGTTGATG TCGAAGCCTG GATTGCCGGA GAGGTGGATA TTGCCTTTGC CGAGCAGGAA GGGGCAGCCT TTGTGGCGGG CGATGGTGTT AACAAGCCGA AGGGCTTCCT CGCCTATGAG ACGGTGGCCG ACACCGCCTG GGCCTGGGGC AAGATCGGCT TCAAGGCGAC CGGCGCTGCC GGTGGCTTTG CCGCAAGCGG GCCATCCGAC GTATTGCTCG ACACCATCTA CGCGCTGAAG GCCGGGCATC GCCAGAACGG CACGTTTGTG ATGAACCGCA AGACCCAGGG CGAAATCCGC AAGTTCAAGG ATGCCGACGG CAATTATCTC TGGCTGCCGC CCGCAGGCCC TGGCCTTGAA GCCTCGTTGA TGGGCTTTCC GATTGCCGAG GCTGAGGACA TGCCCGATAT CGCCGCCAAC GCCTTTTCCA TCGCCTTTGG CGATTTCAAG GCCGGGTATC TGGTGGTTGA CCGCATGGGT GTGCGGGTGC TGCGCGATCC CTATTCGGCC AAGCCCTATG TGCTGTTCTA CACCACCAAA CGGGTTGGTG GTGGAATGCA GAACTTTGAA GCGCTCAAGC TGATCAAGTT TGCTGCCAGT TAA
|
Protein sequence | MSDHQMTAPE IKAIPETMAA AFDDFMEAFE GFKQANDQRL GEIEQKLTAD VVTRDKVERI NKAMDEQSRL LDQLALKKLR PALGSNGNRG GSGSLEATER KAAFEAYIRR GDETALRDLD AKSMAIGSDA DGGYLVTDET DSEIGRRLAS ISPIRQLASV RQVSGAVLKK PFAPSGMASG WVAETAARTQ TDTPQLTELS FPTMEIYAMP AATQSLLDDA AVDVEAWIAG EVDIAFAEQE GAAFVAGDGV NKPKGFLAYE TVADTAWAWG KIGFKATGAA GGFAASGPSD VLLDTIYALK AGHRQNGTFV MNRKTQGEIR KFKDADGNYL WLPPAGPGLE ASLMGFPIAE AEDMPDIAAN AFSIAFGDFK AGYLVVDRMG VRVLRDPYSA KPYVLFYTTK RVGGGMQNFE ALKLIKFAAS
|
| |