Gene Acry_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_3021 
Symbol 
ID5160676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp3303811 
End bp3305304 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content68% 
IMG OID640554951 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001236130 
Protein GI148262003 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAA TCGGTCATTT CATCAACAAC GAGACCGTCG CGGGCACGGG GGCGGAGCGT 
CTGCCCGTCT TCAACCCGGC GACCGGGGCG CAGACCGGCG CGGTGGCGGT GGCGACGGCG
GCGGAAATCG AGGCGGCGAT CGCCGCGGCG CGCGCCGCCT TCCCCGCCTG GGCCGCCACC
CCGCCGCTGC GCCGGGCGCG GATCATGTTC CGCTTCAAGG AACTCGTCGA GCAGCACAAG
GACGAGCTGG CGAAGGCGAT CACGCTGGAA CACGGCAAGG TCATTTCCGA TGCCGGCGGC
GAGGTCGTGC GCGGCCTTGA AGTGGTGGAA TTCGCCTGCG GCATTCCCGA ACTGCTGAAG
GGCGATTATT CGCGCAATGT CGGCACCAAC ATCGACAGCT ACTCGTTCCG CGCGCCGCTC
GGCGTCGTTG CCGGCGTCAC GCCGTTCAAC TTCCCGGTGA TGGTGCCGAT GTGGATGTTC
CCGATGGCGA TCGCCTGCGG CAACTGCTTC ATCCTCAAGC CCTCGGAAAA GGACCCCTCG
CCCGGCCTGT TGCTCGCCGA ACTGCTGAAG GAAGCCGGGC TGCCGGCAGG CGTGTTCAAC
GTGATCAACG GCGACCGCAC CGCCGTCGAC GCGCTGCTGA CAGACCCGCG CATCGCCGCC
GTCAGCTTCG TCGGCTCGAC GCCGATTGCC GAGCACATCT ACCGCACCGC CTCGCATCAC
GGAAAGCGCG TGCAGGCGCT GGGCGGCGCC AAGAACCACA TGATCGTCAT GCCCGACGCC
GATCTCGACC AGGCGGTCGA CGCGCTGATC GGCGCCGCCT ACGGCTCGGC CGGCGAGCGC
TGCATGGCGA TCTCGGTCGC CGTCGCGGTG GGCGGCATCG CCGACCCGCT GGTCGAGCGC
ATCGCGAGCC GGGCGCGCAG CCTGAAAGTC GGCCCCGGGC TCGACCCCGA GGCCGAGATG
GGGCCGCTGA TCACCAGGGA ACACCAGGCG AAGGTCGAAT CCTATATCGA GGCCGGCATC
GCGGAAGGGG CGCGCCTCGT CGTCGATGGC CGCGGCCTGA GATTGCAGGG CCACGAGGAC
GGCCATTTCG TCGGCCCGAC CCTGTTCGAC CATGTCCGGC CCGGGATGAA GATCCACCGC
GAGGAGATCT TCGGCCCGGT GCTCTCGGTG GTGCGCGAGG CGAATTTCGA TGGCGCGGTG
AGGATCATCA ACGAGCACGA ATTCGGCAAC GGCACCTCGA TCTTCACCCG CGATGGCGAC
GCCGCGCGCA GCTTCGCCGA CCAGATCGAG GTCGGCATGG TCGGCATCAA CGTGCCGATC
CCGGTGCCGA TGGCGTTCCA CTCCTTCGGC GGCTGGAAAC GCTCGGCCTT CGGCGATCAC
GGCATGCACG GCCATGAAGG CGTGCATTTC TACACCAAGC TGAAGACCAT GACCTCGCGC
TGGCCGACCG GCATCCGCGC CGGCGCCGAA TTCGCGATTC CGACCATGCG GTAA
 
Protein sequence
MKTIGHFINN ETVAGTGAER LPVFNPATGA QTGAVAVATA AEIEAAIAAA RAAFPAWAAT 
PPLRRARIMF RFKELVEQHK DELAKAITLE HGKVISDAGG EVVRGLEVVE FACGIPELLK
GDYSRNVGTN IDSYSFRAPL GVVAGVTPFN FPVMVPMWMF PMAIACGNCF ILKPSEKDPS
PGLLLAELLK EAGLPAGVFN VINGDRTAVD ALLTDPRIAA VSFVGSTPIA EHIYRTASHH
GKRVQALGGA KNHMIVMPDA DLDQAVDALI GAAYGSAGER CMAISVAVAV GGIADPLVER
IASRARSLKV GPGLDPEAEM GPLITREHQA KVESYIEAGI AEGARLVVDG RGLRLQGHED
GHFVGPTLFD HVRPGMKIHR EEIFGPVLSV VREANFDGAV RIINEHEFGN GTSIFTRDGD
AARSFADQIE VGMVGINVPI PVPMAFHSFG GWKRSAFGDH GMHGHEGVHF YTKLKTMTSR
WPTGIRAGAE FAIPTMR