Gene Avi_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_0441 
Symbol 
ID7388452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp375110 
End bp376147 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content60% 
IMG OID643650096 
Producthemolysin 
Protein accessionYP_002548311 
Protein GI222147354 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.327012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT TTTCGACACG ATCGGCCAGT GAGGCCACGA AGGAGCAGGA CGGCTCCTCC 
TCCGAAGAGG GGTCTAGTCC GCAGCGAAGT TCCGCGGCCC ACAAGCCGCA ATCATCTTTC
TGGGCGCGGG CCGCCCGGAT CTTGAAACCC GCCGGCGGCA ATCTGCGTGA GGATATCGCC
GACGCGCTGA TGTCCGACAG GGCCGCTGAG GAGGCCTTTT CCGCCGAAGA GCGGGCGATG
TTGCACAATA TCCTGCGCTT TCGCGAAGTG CGCGTCGAAG ATGTCATGGT GCCGCGCTCC
GATATCCATG CTGTCGATGT CGAGACCAGC ATTGGCGAAT TGATGACCCT GTTCCAGCAA
ACCGGCCATT CGCGCATGCC GGTCTATTGC GACACGCTGG ATGATCCACG CGGCATGGTG
CATATCCGCG ATCTTCTCTC CTATATCACC CTGAAGGCGC TGAACGGCAA CGGCCTCGAC
CTCGCCTGCG TCGATCTTGG CGTGACGCTG GAGGAGGCTG GCATCATCCG CTCCATCCTG
TTCGTGCCAC CGTCCATGCA AGCCTCCGAC CTTCTGGCCC GTATGCAGGC TGCCCGCACC
CAGATGGCGC TGGTGATCGA CGAATATGGC GGCACCGATG GCCTGGTTTC GCATGAAGAT
ATCGTCGAAA TGGTGGTTGG CGACATTGAA GACGAGCACG ACAAGGAAGA GGCGCTGGTC
ACCCGCGTCT CACAGGATGT CTATCTGGCC GATGCCCGCA TCGAACTTGA GGAAATCGCC
GAGGTGATCG GTCCTGATTT CGATATCAGT GCGGAAATCG ATGAAGTCGA TACTCTGGGC
GGATTGCTCT CCACCGCGAT TGGCCGGGTG CCGCAGCGTG GTGAAGTGGT GCAGGCGGTG
GCGGGCTTCG AGCTACATAT TCTCGATGCC GATCCGCGAA GAGTGAAGAA GGTCCGTATC
ACCCGCATGG CACCAATTGC CAAGCGCCTG CAGGAAGGCG CCGATTTGCA GACAGTCGGA
TCCGGGCAGG GTAAGTAG
 
Protein sequence
MSDFSTRSAS EATKEQDGSS SEEGSSPQRS SAAHKPQSSF WARAARILKP AGGNLREDIA 
DALMSDRAAE EAFSAEERAM LHNILRFREV RVEDVMVPRS DIHAVDVETS IGELMTLFQQ
TGHSRMPVYC DTLDDPRGMV HIRDLLSYIT LKALNGNGLD LACVDLGVTL EEAGIIRSIL
FVPPSMQASD LLARMQAART QMALVIDEYG GTDGLVSHED IVEMVVGDIE DEHDKEEALV
TRVSQDVYLA DARIELEEIA EVIGPDFDIS AEIDEVDTLG GLLSTAIGRV PQRGEVVQAV
AGFELHILDA DPRRVKKVRI TRMAPIAKRL QEGADLQTVG SGQGK