Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5112 |
Symbol | |
ID | 7380917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 100041 |
End bp | 101228 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643648770 |
Product | non-heme haloperoxidase |
Protein accession | YP_002547007 |
Protein GI | 222106216 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAACC GGAAAAGTCG CCGCGTTCGC GTTAATCATC GCGACGCGGT TGTCGCCGAG ACCCTCTCCA TTATTTCAAT AATCGCAATT AAATCTCAAT TAATAATGCA ATTGTTGGCA AGGGTGTTTG GCGGCAATAT GCCCTCATCA GACAAACCGA GTGGTCTGAC GACACAGAAA CAAACAAGAC CGCAGGACGC GAACATGACC GACACCACTC TTCTTTCCCT GACACGTCGT GGCGTCCTGC TTTCCGGATC GGCCGTCGCC GCAACCGCGC TCACCGCATC ATTCAGCTTT GCAGCCAAGC TCGCTGCGTC CACCAACACA TCAACCGAAG GAAACAAGAC CATGGGTACC ATCACCACCA AGGACGGCGT TGAAATCTTC TACAAGGATT GGGGCTCGAA GGATGCTCAG CCGATTGTAT TTCATCATGG CTGGCCGCTG TCATCGGATG ACTGGGACGC GCAGATGCTG TTCTTCCTCG CCAACGGTTA CCGCGTGATT GCCCATGACC GGCGCGGCCA CGGCCGTTCG GCGCAGGTTT CTGATGGCCA CGACATGGAC CATTATGCAG CTGATGCCTT CGCGGTCGTT GAAGCCCTCG ACCTCAAGAA CGCCGTCCAT ATCGGTCACT CCACCGGCGG AGGAGAAGTC GCCCGGTATG TGGCCATGCA TGGCCAGCCA GCCGGTCGCG TCGCCAAGGC GGTTCTGGTG TCTGCCGTCC CGCCGCTGAT GCTGAAGACT GACGCCAATC CCGAAGGCCT GCCGATGGAA GTTTTCGACG GTTTCCGCTC AGCGCTGGCT GCCAACCGCG CGCAGTTCTT CCGCGACGTT CCCGCCGGTC CATTCTATGG CTTCAACCGT GACGACGCCA AAGTCCAGGA GGGCGTGATC CAGAATTGGT GGCGTCAGGG TATGATGGGC GGCGCGAAGG CGCATTACGA CGGCATCAAG GCTTTCTCTG AAACCGACCA GACCGAGGAT CTGAAGACGA TAACCGTCCC AACTCTCGTC CTGCATGGCG AAGACGACCA GATCGTGCCC ATCGCCGATG CAGCACTGAA AGCTATCAAG CTGCTGAAGA ATGGTACGCT CAAGACCTAT CCCGGCTTCT CGCACGGTAT GCTGACCGTC AATGCCGATG TGCTCAACGC CGACCTGCTG GCTTTCGTGA AGTCCTGA
|
Protein sequence | MHNRKSRRVR VNHRDAVVAE TLSIISIIAI KSQLIMQLLA RVFGGNMPSS DKPSGLTTQK QTRPQDANMT DTTLLSLTRR GVLLSGSAVA ATALTASFSF AAKLAASTNT STEGNKTMGT ITTKDGVEIF YKDWGSKDAQ PIVFHHGWPL SSDDWDAQML FFLANGYRVI AHDRRGHGRS AQVSDGHDMD HYAADAFAVV EALDLKNAVH IGHSTGGGEV ARYVAMHGQP AGRVAKAVLV SAVPPLMLKT DANPEGLPME VFDGFRSALA ANRAQFFRDV PAGPFYGFNR DDAKVQEGVI QNWWRQGMMG GAKAHYDGIK AFSETDQTED LKTITVPTLV LHGEDDQIVP IADAALKAIK LLKNGTLKTY PGFSHGMLTV NADVLNADLL AFVKS
|
| |