Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5226 |
Symbol | |
ID | 7381354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 225377 |
End bp | 226816 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643648866 |
Product | aldehyde dehydrogenase |
Protein accession | YP_002547103 |
Protein GI | 222106312 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTCGATA TGGCCATTGA AACTCTCCAG TCTATTCTGG AACGCCAGCG ACAATCCTTC CAGCATGACA GTTATCCGTC ACTGTCCTTG CGTCGGGACC GCCTGAACCG GATCGGCCGA TTGCTGAAGG AGAACCGGCA AGCGCTGTGC GATGTGGTTT CTCGGGATTT TGGTCATAGA TCCGATCATG AAACGGTGCA ATTGGAGATT GCTCCGCTCA TGAGCGCGCT TCGCCATACA CGGTCTCATC TACGCCGATG GATGAAACGC GAACGCCGTG GACGCTCGAT TGAGTTTCTG CAGCTTGCAA ATTGGGTCCA ATACCAGCCG CTCGGGGTGA TTGGTATTAT GGTGCCGTGG AACTATCCGC TTCTACTGGC ACTCGGACCG CTGATTGACA TTTTGGCTGC GGGAAACCGG GCGATCATCA AACCGTCGGA ATTGCTGCCG GAAACCTCGG CTCTTCTTTC AAAGCTCGTC GAGGCTTATT TCTCCCCGGA GGAGGTTGCC GTTATCGAAG GAGGGGTGGA GATTGCCGCA GCCTTTTCTG CCCTGCCGTT CGATCATCTG ATTTTCACCG GCTCAACGGC AGTCGGTCGC AAGGTTATGG CGTCAGCAGC TGCTAACCTG ACACCGCTCA CCCTGGAACT GGGTGGCAAG TCGCCCGCAC TCATCGCCCC GGATTATCCC ATTGCCGATG CAGCCCGTGA CATCGCCTTT GGAAAGCTGA TGAACGCGGG CCAGACTTGC ATTGCACCTG ACTATGTTCT GGTCGAGAAA TCAAAACTTG GAGACCTTGC ATCCGCCCTG ATCTCTCAGG CAGAGGCTTT TTATCCACGA CAGGCGGGAC CACAACATGC GGGGCCACAA CACGCAGGGC AAGAACAGTA TTCAAGCCTT GTCGGCGCTC GAGCGCATGA ACGGTTGCTG AAAGGCATTG AGGAATGCCG CGCCCGTGGG GCCAAACTCA TCACTGCTGA TATCGCCATG CCCTCTCAAG GACACGTGAT CGCACCCACG CTGGTAATCG ACCCGCCTGC GGACTGCTTG CTGATGGAGG AAGAAATCTT CGGGCCAATC CTGCCCCTTA TTCCCTATGA GGATTTTGAC ACAGCGTTGA AATTTGTCCG CGAGCGCCCG CGCCCTCTGG CGCTCTATAT CTTCACGGGA AACCGGGCGA CCGAGAAAAA AGCACTGTCG AACACGATTT CCGGCAATGT CACTATCAAC GGTACCCTTC TGCATATCGC TCAAAACGAC CTGCCCTTTG GCGGTATCGG GCCAAGCGGC ATGGGGGCCT ATCATGGTCA TGAGGGCTTC AAGCGTTTTT CGCACGCCCG CGGCATTGCG AAAGTCCGTC TTTTCAATCC CGCACGCCTC GCCATGCCTC CTTACGGATG GCTGGCACAA GTTCTGGCCA GGTTTATGAT GCGTGACTAA
|
Protein sequence | MFDMAIETLQ SILERQRQSF QHDSYPSLSL RRDRLNRIGR LLKENRQALC DVVSRDFGHR SDHETVQLEI APLMSALRHT RSHLRRWMKR ERRGRSIEFL QLANWVQYQP LGVIGIMVPW NYPLLLALGP LIDILAAGNR AIIKPSELLP ETSALLSKLV EAYFSPEEVA VIEGGVEIAA AFSALPFDHL IFTGSTAVGR KVMASAAANL TPLTLELGGK SPALIAPDYP IADAARDIAF GKLMNAGQTC IAPDYVLVEK SKLGDLASAL ISQAEAFYPR QAGPQHAGPQ HAGQEQYSSL VGARAHERLL KGIEECRARG AKLITADIAM PSQGHVIAPT LVIDPPADCL LMEEEIFGPI LPLIPYEDFD TALKFVRERP RPLALYIFTG NRATEKKALS NTISGNVTIN GTLLHIAQND LPFGGIGPSG MGAYHGHEGF KRFSHARGIA KVRLFNPARL AMPPYGWLAQ VLARFMMRD
|
| |