Gene Avi_1338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1338 
Symbol 
ID7389083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1122759 
End bp1124021 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content62% 
IMG OID643650754 
Productphage major capsid protein HK97 family 
Protein accessionYP_002548960 
Protein GI222148003 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0373474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATC ATCAGATGAC TGCCCCCGAA ATCAAGGCGA TCCCGGAAAC CATGGCCGCC 
GCCTTCGACG ACTTTATGGA AGCCTTCGAA GGCTTCAAGC AGGCCAATGA CCAGCGGCTC
GGCGAAATCG AGCAGAAACT GACCGCCGAT GTCGTCACCC GCGACAAGGT GGAGCGGATC
AACAAGGCGA TGGACGAACA GTCCCGGCTG CTGGACCAGC TGGCGCTGAA AAAGCTGCGC
CCGGCGCTGG GATCGAACGG CAATCGTGGC GGTTCCGGCA GCCTGGAGGC GACAGAGCGC
AAGGCGGCCT TCGAGGCCTA TATCCGGCGC GGCGATGAAA CGGCGCTGCG CGATCTCGAT
GCCAAATCCA TGGCGATTGG TTCGGATGCG GATGGCGGAT ATCTGGTGAC GGATGAGACC
GATAGTGAGA TTGGCCGCAG GCTCGCCTCC ATCTCGCCGA TCCGGCAATT GGCCAGCGTC
CGGCAGGTGT CGGGCGCGGT GCTGAAAAAG CCGTTTGCCC CAAGCGGCAT GGCCTCCGGC
TGGGTGGCGG AAACCGCGGC CCGCACCCAG ACCGATACGC CGCAGCTGAC GGAACTTTCC
TTTCCGACCA TGGAGATTTA CGCCATGCCC GCCGCCACCC AGTCGCTGCT GGATGATGCT
GCTGTTGATG TCGAAGCCTG GATTGCCGGA GAGGTGGATA TTGCCTTTGC CGAGCAGGAA
GGGGCAGCCT TTGTGGCGGG CGATGGTGTT AACAAGCCGA AGGGCTTCCT CGCCTATGAG
ACGGTGGCCG ACACCGCCTG GGCCTGGGGC AAGATCGGCT TCAAGGCGAC CGGCGCTGCC
GGTGGCTTTG CCGCAAGCGG GCCATCCGAC GTATTGCTCG ACACCATCTA CGCGCTGAAG
GCCGGGCATC GCCAGAACGG CACGTTTGTG ATGAACCGCA AGACCCAGGG CGAAATCCGC
AAGTTCAAGG ATGCCGACGG CAATTATCTC TGGCTGCCGC CCGCAGGCCC TGGCCTTGAA
GCCTCGTTGA TGGGCTTTCC GATTGCCGAG GCTGAGGACA TGCCCGATAT CGCCGCCAAC
GCCTTTTCCA TCGCCTTTGG CGATTTCAAG GCCGGGTATC TGGTGGTTGA CCGCATGGGT
GTGCGGGTGC TGCGCGATCC CTATTCGGCC AAGCCCTATG TGCTGTTCTA CACCACCAAA
CGGGTTGGTG GTGGAATGCA GAACTTTGAA GCGCTCAAGC TGATCAAGTT TGCTGCCAGT
TAA
 
Protein sequence
MSDHQMTAPE IKAIPETMAA AFDDFMEAFE GFKQANDQRL GEIEQKLTAD VVTRDKVERI 
NKAMDEQSRL LDQLALKKLR PALGSNGNRG GSGSLEATER KAAFEAYIRR GDETALRDLD
AKSMAIGSDA DGGYLVTDET DSEIGRRLAS ISPIRQLASV RQVSGAVLKK PFAPSGMASG
WVAETAARTQ TDTPQLTELS FPTMEIYAMP AATQSLLDDA AVDVEAWIAG EVDIAFAEQE
GAAFVAGDGV NKPKGFLAYE TVADTAWAWG KIGFKATGAA GGFAASGPSD VLLDTIYALK
AGHRQNGTFV MNRKTQGEIR KFKDADGNYL WLPPAGPGLE ASLMGFPIAE AEDMPDIAAN
AFSIAFGDFK AGYLVVDRMG VRVLRDPYSA KPYVLFYTTK RVGGGMQNFE ALKLIKFAAS