Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18440 |
Symbol | |
ID | 7760778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1825049 |
End bp | 1826548 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643804742 |
Product | Aldehyde dehydrogenase family protein |
Protein accession | YP_002799031 |
Protein GI | 226943958 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.931316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCGT CCAACGATAC CCGTTCGAAC TACGCTCCCG ACAGCAGCTA CGGCCTGTTC ATCGACAATC AGTGGGTTGC AGGTGAAAAC GGCGAAACCA TCACCATCCT CAATCCCGCC AACGGGAAAA CCCTCACCGG CATCCCGAAC GCCACGGCGG TCGACGTCGA CCGTGCAGTA CAGGCCGCGC AACGCGCTTT CGAAGCCTGG CGCAGCACCA CGCCAATAGA ACGCGCCAAT GCGCTGCTGA AGATCGCCGA CTTGCTGGAA GCCGACGCCG AACGGTTCGC CGCCCTGGAA TCCCTCGATG TAGGCAAGCC AATCCGTGAG AGCAGTTCCG TCGACATCCC GTTGGCGATC GATCACTTCC GCTATTTCGC CGGCGTGATC CGCAGCCACT CGGACGAGGC AGTCATGCTG GATGAACAGA CGCTCAGCAT CGTGCTCAGC GAGCCGCTGG GCGTCGTCGG CCAGGTGATC CCTTGGAACT TCCCACTGCT GATGGCCGCC TGGAAAATCG CCCCGGCCAT CGCGATAGGA AATACCGTCG TCATCAAACC TTCCGAACTG ACCCCGGTCA GCATCCTCGA ACTCGCAAAG ATCTTCGCCC AGGTATTGCC GGCCGGGGTT GTGAACATCG TCACCGGTAC GGGTGCCTCG GCGGGCCAGG CGCTGCTGGA CCATCCGGAC GTGCGCAAGC TTGCCTTCAC CGGCTCGACA AGTGTCGGCC ATCGGGTGGC CGACGCGGCG GCGAAGAAGC TCATTCCGGC GACCCTCGAG CTGGGCGGCA AGTCGGCCAA TATCGTCTTC CCCGATGCCA ACTGGGACAA GGCCGTGGAA GGCGCAGCAC TCGCCATTCT GTGGAACCAG GGCCAAGTCT GCGAATCCGG CGCTCGGCTG TTCGTGCACG AGTCGATCTA CGAGCGCTTC CTGGATGAGG TCAAGCAGAA ATTCGAAGCC GTGCGCGTGG GCGACCCGCT GCATCCGGAC ACCATGATGG GTGCCCAGGT CAGCAAGACA CAGATGGAGC GAATCCTTGG CTACGTCGAT ATCGCCAAGC AGGAAGGTGC CAAGGTACTG CTAGGCGGTG GTCGTCTGAC AGGTGCCGAT TACGATGCGG GCTTCTTCAT CCAGCCAACA ATCCTGGTCG ACGTACGCAA CGACATGCGT GTGGCCTACG AGGAAATCTT CGGTCCCGTT CTATGCGTGA TTCCGTTCAA GGACGAAGCG GACGTCATTG CCATGGCCAA CGATTCGGAA TACGGCCTGG CGGGCGCGGT CTGGACCCAG GACATCAACC GGGCGCTGCG CGTGGCACGC GCGGTGGAAA CCGGGCGGAT GTGGGTCAAC ACCTATCACG AAATCCCCGC CCACGCGCCC TTCGGCGGCT ACAAGAAATC CGGCCTGGGA CGGGAAACCC ACAAGTCGAT TCTGGAAGCC TACAGCCAGA AGAAGAACAT CTATGTCAGC CTCAACGAAG CGCCGCTCGG GTTGTTCTGA
|
Protein sequence | MQASNDTRSN YAPDSSYGLF IDNQWVAGEN GETITILNPA NGKTLTGIPN ATAVDVDRAV QAAQRAFEAW RSTTPIERAN ALLKIADLLE ADAERFAALE SLDVGKPIRE SSSVDIPLAI DHFRYFAGVI RSHSDEAVML DEQTLSIVLS EPLGVVGQVI PWNFPLLMAA WKIAPAIAIG NTVVIKPSEL TPVSILELAK IFAQVLPAGV VNIVTGTGAS AGQALLDHPD VRKLAFTGST SVGHRVADAA AKKLIPATLE LGGKSANIVF PDANWDKAVE GAALAILWNQ GQVCESGARL FVHESIYERF LDEVKQKFEA VRVGDPLHPD TMMGAQVSKT QMERILGYVD IAKQEGAKVL LGGGRLTGAD YDAGFFIQPT ILVDVRNDMR VAYEEIFGPV LCVIPFKDEA DVIAMANDSE YGLAGAVWTQ DINRALRVAR AVETGRMWVN TYHEIPAHAP FGGYKKSGLG RETHKSILEA YSQKKNIYVS LNEAPLGLF
|
| |