Gene Avi_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3005 
Symbol 
ID7386164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp2511068 
End bp2512363 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content62% 
IMG OID643651992 
Productprophage MuSo2 F protein 
Protein accessionYP_002550176 
Protein GI222149219 
COG category[S] Function unknown 
COG ID[COG2369] Uncharacterized protein, homolog of phage Mu protein gp30 
TIGRFAM ID[TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.680604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGA CGATCGCAGC GCTGAAGCCG GACGATGCCA TCAAGGCTCT GAAGGCGCGC 
GGCGAACAGC TCGCGCCGTC CTTTTCCTGG CAGGACGTTT ATGCCGAGGA GCATGCCAAA
CAGTTCACGG TCGCCAAGTC GGCCGGTTTC GATATCCTCA CAGACCTTTT CGACGGGCTT
CAAACCAGCC TTGAAGAGGG CAAGACGTTT CGGGATTTCG CCAGCCAGGT GACGCCGGTT
CTCCAGGCTA AGGGCTGGTG GGGCGTCCAG GACGTGACCG ATCCAGTGAC AGGGGAGCTT
CGCAAAGCGC AGCTCGGCTC GACCCGTCGC CTGCAATTGA TCTTCGACGT GAACCTTCGC
GTCTCTTATG CGGCGGGTCA TTGGGCGGCC TTTGAGCGCA ACAAGGCGCG CCGGCCTTGG
CTGCGCTATG TCTGTATTCT GGACGACCAC ACCCGGCCGG AACACCGCAA GCGCCACAAT
CTTTGCCTGC CCGTCGATCA TCCCTATTGG GACACATGGG CACCGCCTTG TGGCTGGAAT
TGCCGCTGCA CGCTGCAAAG CCTGTCGGAT CGGGATGTCG AGCGGATGCG GGGTGAGTTG
AAGTTCACGC CGCCCGAAGA TGACTTCGTT GCCTTCACCA ACAAGCGCAC TGGCGAAGTC
CGGATGATCC CGCGCGGCAT CGATCCCGGT TGGGACCACA ATCCCGGCAA GGCTGGCTTT
CGGGCCTTCG ATGCGGCGGA AAAGCTGATC AATGCACCGC CGATCATGGC CGCCCAGGTC
AACAAAGATC CGGACTGGCT GGTCAAGCCG CTCGGCGATG ACTTTGCGAG GTGGTTTGAT
GCGGCCACAG CGGGCGGGCG CGTGGACCGG TCCATCATCG TGGTTGGCGC TTTGTCCGAG
GATGTCCTGG CATCTCTTGC CCAGGGCGGG ATTGCGCCGC AGTCAGGCGC GATCACCCTG
ACCCAGCAAG CTGCTCTGCA TATGATCCGC GATGCCAAGG CCGGAGTGGG AAAGACCGTC
GATATGGCGG CGCTTCGGCA ACTGCCGGCC AATCTCAGCC GGCCGAGGGC GGTCCTGCGG
GATAAGCGCG ACGGTGCGCT TCTCTATGTG TTCGACAGCG GCCAAGACCC GCGTCTGGCC
AAGATCGTTG TGAAGGTCGA TTTCGCAGAT AAGGCCCGGC CACCAGGGGG AAAAGCCCAG
ACGATCGTCA CCAATTCGAT CCGAACTGCG GGGCTGGTTG AAGCCCGCGT CCTGACAGAC
GAGAAGACTT ACGAGCTTAT CAGTGGGACG ATTTAG
 
Protein sequence
MTATIAALKP DDAIKALKAR GEQLAPSFSW QDVYAEEHAK QFTVAKSAGF DILTDLFDGL 
QTSLEEGKTF RDFASQVTPV LQAKGWWGVQ DVTDPVTGEL RKAQLGSTRR LQLIFDVNLR
VSYAAGHWAA FERNKARRPW LRYVCILDDH TRPEHRKRHN LCLPVDHPYW DTWAPPCGWN
CRCTLQSLSD RDVERMRGEL KFTPPEDDFV AFTNKRTGEV RMIPRGIDPG WDHNPGKAGF
RAFDAAEKLI NAPPIMAAQV NKDPDWLVKP LGDDFARWFD AATAGGRVDR SIIVVGALSE
DVLASLAQGG IAPQSGAITL TQQAALHMIR DAKAGVGKTV DMAALRQLPA NLSRPRAVLR
DKRDGALLYV FDSGQDPRLA KIVVKVDFAD KARPPGGKAQ TIVTNSIRTA GLVEARVLTD
EKTYELISGT I