Gene Avin_10720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_10720 
Symbol 
ID7760016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1018121 
End bp1019626 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content70% 
IMG OID643803976 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_002798278 
Protein GI226943205 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.593646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCACT CCATCCCCAG CATCAAACTG CTGATCGACG GCCAGTTCGT CGAATCCACC 
ACCAGCCAAT GGCGCGAGGT GGTCGACCCG GCCACCCAGC AGGTCCTGGC CCGCGTCCCC
TTCGCCAGCG AGGCGGAGCT GAACGCGGCG GTGGCCAGCG CCGCCGCGGC GTTCAAGACC
TGGCGCAAGA CCTCCATCGG TACCCGCGCG CGGCTCTTCC TCAAGTACCA GCAACTGATC
CGGGAGAACC TCAAGGAACT GGCGGCGATC CTCAGCGCCG AGCAGGGCAA GACCCTGGCC
GACGCCGAGG GCGACGTGTT CCGCGGCCTG GAGGTGGTCG AGCACGCCGC CGGCATCGGC
AACCTGCAGC TCGGCGAACT GGCCAACAAC GTCGCCGGCG GCGTCGACAC CTACACCCTG
CTGCAGCCGC TCGGCGTCTG CGCCGGCATC ACCCCCTTCA ACTTTCCGGC GATGATCCCG
CTGTGGATGT TCCCGATGGC CATCGCCACC GGCAACACCT TCGTCCTCAA GCCCTCCGAG
CAGGACCCGA TGGTCACCAT GCGCCTGGTC GAGCTGGCCT TGGAAGCCGG CGTGCCGCCG
GGAGTGCTCA ACGTGGTCCA CGGCGGCGCC GAGGTGGTCG ACCGGCTCTG CGACCACCCG
GACATCAAGG CACTCTCCTT CGTCGGATCG AGTCGCGTCG GCGCCCACGT CTACCAGCGC
GCCAGCCAGG CCGGCAAGCG CGTGCAGTGC ATGATGGGCG CGAAGAACCA CGCCGTCGTC
CTGCCCGACG CGCACAAGGA ACAGACCCTC AACAGCCTGG CCGGTGCCGC CTTCGGCGCG
GCCGGCCAGC GCTGCATGGC GATCTCGGTG GCGGTCCTGG TCGGCGCGGC CCGCGACTGG
CTGCCGGAGC TGGTGGCCAA GGCCGCCACC CTCAAGGTCG GCGCCGGCAG CAAGCCCGGC
ACCGACCTCG GCCCACTGAT CTCGCGCGCC GCCCTGGACC GGGTCGGCAG CCTGATCGAG
CAGGGCGTGC GCGAAGGCGC GCGGCTGGAG CTGGACGGCC GCAACCCGGT CGTCGCCGGC
TACGAGCAGG GCAATTTCGT CGGCCCGACC CTGTTCTCCG GCGTCACCCC TGGGATGAGC
CTGTACCGCG AGGAGATCTT CGGCCCGGTG CTCTGCGTGA TGCAGGCCGA GACCCTGGAC
GAGGCCATCG CCATCGTCAA CGCCAACCCC CACGGCAACG GCACCGCCCT GTTCACCCGC
TCCGGCGCCG CGGCCCGGCA CTTCCAGGAG GAGATCGAGG TCGGCCAGGT CGGCATCAAC
GTGCCGATCC CGGTGCCGGT GCCGATCTTC TCCTTCACCG GCTCGCGGGC CTCCAAGCTC
GGCGACCTGG GGCCGTACGG CAAACAGGTG GTGCAGTTCT ACACCCAGAC CAAGACCGTC
ACCCAGCGCT GGTTCGACGA GAACGAGGTC GGCGGCCCGG TCAACACCAC CATCACCCTC
AAGTGA
 
Protein sequence
MTHSIPSIKL LIDGQFVEST TSQWREVVDP ATQQVLARVP FASEAELNAA VASAAAAFKT 
WRKTSIGTRA RLFLKYQQLI RENLKELAAI LSAEQGKTLA DAEGDVFRGL EVVEHAAGIG
NLQLGELANN VAGGVDTYTL LQPLGVCAGI TPFNFPAMIP LWMFPMAIAT GNTFVLKPSE
QDPMVTMRLV ELALEAGVPP GVLNVVHGGA EVVDRLCDHP DIKALSFVGS SRVGAHVYQR
ASQAGKRVQC MMGAKNHAVV LPDAHKEQTL NSLAGAAFGA AGQRCMAISV AVLVGAARDW
LPELVAKAAT LKVGAGSKPG TDLGPLISRA ALDRVGSLIE QGVREGARLE LDGRNPVVAG
YEQGNFVGPT LFSGVTPGMS LYREEIFGPV LCVMQAETLD EAIAIVNANP HGNGTALFTR
SGAAARHFQE EIEVGQVGIN VPIPVPVPIF SFTGSRASKL GDLGPYGKQV VQFYTQTKTV
TQRWFDENEV GGPVNTTITL K