Gene Avi_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1970 
Symbol 
ID7387256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp1625195 
End bp1626472 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content60% 
IMG OID643651219 
Productmajor capsid-like protein 
Protein accessionYP_002549415 
Protein GI222148458 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.244233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATGT CGCAAAGACT GAAAGAATTG CGCGAAAAGC AGGCGCGCCT TGTTACCGAG 
GCCCGCGAAC GGCTGGAGGC GATTACGCCT GACACCGATG AGGTTCGGTC CAAGGAACTG
GAAAGCGCTC ATGATACGGC GATGGCCGAA TATGATCGGC TGCAAGGTAT CCTCGACCGG
GAAGAAAAAC TGATCGAGCT GGAAAAGCGC GATGAAGAGC GCCGCGCCAA ACAGCGCCCG
TTGCGCGATA CCCCGGAAAT GCGTGGCGAC GATCTACCCC AGGGTGGCAA GGTCGAATAT
CGCAGCGTGT TTGCCAAGGT GGTCTGCGGT GTCAGTCCGT CCGATCTGAC CAAGGAAGAG
CGCGCGGTGC TGCAGCGTGG CGCTGCCTCG TTTGAGAGCC GCGATCAGGT GACAACCAGT
GCCGTTGCGG GTGGCTATAC CGTGCCGGTC GAGCTGGAAG CCACGATCAT CCAGGCGATG
AAAGCCTGGG GGCCGCTTTA TGACGAAAAC ATCTGCACAG TGATCACCAC GTCGAAGGGC
AACGAAATGC TGGTGCCGAC CGTTGACGAT ACCGACAACG AGGCCGATCC CCTGGCCGAG
GCGGCCGATC TCCTGGAAGA TGGCAGCGGC GATGTCGAGT TCGGGCAAAA GGTGCTGAAC
GCCTATGTCT ATGCCACGCC GTTCATCAAG TGGTCCTTCG AGCTGGATGC GGATTCGCTG
TTCAACATGG AATCGCTGCT CGGCGGCATG ATCGGTGAGC GCCTCGGTCG GATCGGGAAC
CGCAAGCTGA CATCGGGAAC CGGCCAAAAC CAGCCGAACG GCGTGGTGAA TGCGGCGGGC
CTCGGCGTGA CCACGGCGGC CAAGGATGCC TTCACCTTCG ACAACATCCT CGAACTGGAG
CATTCCATTG ATCCTGCTTA TCGTGGCTCG CCCAAATGCC GGTACATGTT CCATGACAAG
TTCCTGCTGG CCACCCGCAA ACTGAAGGAT GGCAACGGCA ATTACCTTTG GCAGCAGGGG
GATGTGCAGA AGGGAACACC GGCCAGCTTC AACGGTCGCG CCTATTCGAT CAATCAGCAC
ATGGATGAAG TGACGGCCGA CAAGCGGATC GCGCTGTTTG GCGACTTCTC CAAATACTAC
GTCCGCAAGG TCGGGTCGCC CGTCATCGGC GTGCTGCGCG AACGCTTCTG GCCGAAGGTC
GGCATTGCCG GCCTGATCCG CTTCGACGGC GAGCTGGGCG ATGCCAACGC CATCAAGGCC
ATGAAGACGG CGGCCTGA
 
Protein sequence
MTMSQRLKEL REKQARLVTE ARERLEAITP DTDEVRSKEL ESAHDTAMAE YDRLQGILDR 
EEKLIELEKR DEERRAKQRP LRDTPEMRGD DLPQGGKVEY RSVFAKVVCG VSPSDLTKEE
RAVLQRGAAS FESRDQVTTS AVAGGYTVPV ELEATIIQAM KAWGPLYDEN ICTVITTSKG
NEMLVPTVDD TDNEADPLAE AADLLEDGSG DVEFGQKVLN AYVYATPFIK WSFELDADSL
FNMESLLGGM IGERLGRIGN RKLTSGTGQN QPNGVVNAAG LGVTTAAKDA FTFDNILELE
HSIDPAYRGS PKCRYMFHDK FLLATRKLKD GNGNYLWQQG DVQKGTPASF NGRAYSINQH
MDEVTADKRI ALFGDFSKYY VRKVGSPVIG VLRERFWPKV GIAGLIRFDG ELGDANAIKA
MKTAA