Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_0441 |
Symbol | |
ID | 7388452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 375110 |
End bp | 376147 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643650096 |
Product | hemolysin |
Protein accession | YP_002548311 |
Protein GI | 222147354 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.327012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT TTTCGACACG ATCGGCCAGT GAGGCCACGA AGGAGCAGGA CGGCTCCTCC TCCGAAGAGG GGTCTAGTCC GCAGCGAAGT TCCGCGGCCC ACAAGCCGCA ATCATCTTTC TGGGCGCGGG CCGCCCGGAT CTTGAAACCC GCCGGCGGCA ATCTGCGTGA GGATATCGCC GACGCGCTGA TGTCCGACAG GGCCGCTGAG GAGGCCTTTT CCGCCGAAGA GCGGGCGATG TTGCACAATA TCCTGCGCTT TCGCGAAGTG CGCGTCGAAG ATGTCATGGT GCCGCGCTCC GATATCCATG CTGTCGATGT CGAGACCAGC ATTGGCGAAT TGATGACCCT GTTCCAGCAA ACCGGCCATT CGCGCATGCC GGTCTATTGC GACACGCTGG ATGATCCACG CGGCATGGTG CATATCCGCG ATCTTCTCTC CTATATCACC CTGAAGGCGC TGAACGGCAA CGGCCTCGAC CTCGCCTGCG TCGATCTTGG CGTGACGCTG GAGGAGGCTG GCATCATCCG CTCCATCCTG TTCGTGCCAC CGTCCATGCA AGCCTCCGAC CTTCTGGCCC GTATGCAGGC TGCCCGCACC CAGATGGCGC TGGTGATCGA CGAATATGGC GGCACCGATG GCCTGGTTTC GCATGAAGAT ATCGTCGAAA TGGTGGTTGG CGACATTGAA GACGAGCACG ACAAGGAAGA GGCGCTGGTC ACCCGCGTCT CACAGGATGT CTATCTGGCC GATGCCCGCA TCGAACTTGA GGAAATCGCC GAGGTGATCG GTCCTGATTT CGATATCAGT GCGGAAATCG ATGAAGTCGA TACTCTGGGC GGATTGCTCT CCACCGCGAT TGGCCGGGTG CCGCAGCGTG GTGAAGTGGT GCAGGCGGTG GCGGGCTTCG AGCTACATAT TCTCGATGCC GATCCGCGAA GAGTGAAGAA GGTCCGTATC ACCCGCATGG CACCAATTGC CAAGCGCCTG CAGGAAGGCG CCGATTTGCA GACAGTCGGA TCCGGGCAGG GTAAGTAG
|
Protein sequence | MSDFSTRSAS EATKEQDGSS SEEGSSPQRS SAAHKPQSSF WARAARILKP AGGNLREDIA DALMSDRAAE EAFSAEERAM LHNILRFREV RVEDVMVPRS DIHAVDVETS IGELMTLFQQ TGHSRMPVYC DTLDDPRGMV HIRDLLSYIT LKALNGNGLD LACVDLGVTL EEAGIIRSIL FVPPSMQASD LLARMQAART QMALVIDEYG GTDGLVSHED IVEMVVGDIE DEHDKEEALV TRVSQDVYLA DARIELEEIA EVIGPDFDIS AEIDEVDTLG GLLSTAIGRV PQRGEVVQAV AGFELHILDA DPRRVKKVRI TRMAPIAKRL QEGADLQTVG SGQGK
|
| |