Gene Arth_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0420 
Symbol 
ID4447115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp448971 
End bp450470 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content66% 
IMG OID639688219 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_829921 
Protein GI116668988 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGCG AGCTTTCCCA TTACATAGAC GGGCAACGGG TCCACGGCAC CTCCGGGCGC 
TTCGGCGACG TCTACGATCC CTGCACCGGC GAGGTGCAGG CCAGGCTGCC GCTGGCCAGC
GCCGACGAAG TCCGCAATGC CATTGCCAGC GCGGAGAAGG CCCAGGTGGA GTGGGGTGCC
ATGAATCCCC AGCGTCGCGG CCGGATCCTG CTGAAGTTCG TGGACCTGGT GAACGAGCAC
CTGGACGAAC TCGCCGCGCT GCTCTCCTCG GAACATGGCA AGACGCTTCC GGACGCCAAG
GGGGACATCC AGCGCGGCAT CGAGGTGGTG GAGTTCGCGG CCGGCGCCCC GCACCTGCTC
AAAGGCGAAT TCTCGGACAA TGCCGGGACC GGAATTGACG TACATTCGCT GCGCCAGCCG
CTGGGCGTGG TGGCCGGAAT TACGCCGTTC AACTTCCCTG CCATGATTCC GCTGTGGAAG
TCCGGACCTG CCTTGGCTGC TGGCAACGCC TTCATCCTCA AACCCTCGGA GCGTGACCCC
TCCGTGCCGC TGCGCCTCGC GGAGCTCTTC ACCGAAGCAG GCGTCCCGGA CGGTGTGTTC
AACGTGATCA ACGGGGACAA GGAAGCAGTG GATGCGCTGC TCGAGGACGA GAGGGTCAAA
GCGATCGGAT TCGTGGGTTC CACGCCCATT GCCCAGTACA TCTACGCCAC CGCAGCCGCC
CACGGAAAGC GCGCACAGTG CTTCGGCGGG GCCAAGAACC ACATGGTGAT CATGCCGGAC
GCGGACCTGG ACATGGCCGT GGATGCGCTG ATGGGTGCGG GTTACGGCTC CGCCGGCGAA
CGGTGCATGG CCATCTCCGT TGCGGTCCCG GTGGGCAAAG AAACGGCCGA CGCCCTGGTC
TCCAAGCTGG AGGAACGGGT CAAGCACCTC AAGGTGGGGC ACAGCCTGGA CAAGGACTCG
GACTTCGGCC CGGTAGTGGC GGCGTCCGCC AAGGAGCGCA TCGAGGGCTA CATCCAGTCC
GGCGTGGACC AGGGCGCCAC CCTCGTGGCG GACGGCCGCG GCCTCACCGT GGACGGCTAC
GGCGGCGGGT TCTGGGTTGG CCCCACGCTC TTCGACAACG TCACCAAGGA CATGAAGATC
TACAAGGAGG AAATCTTCGG CCCGGTGCTC AGCGTCCTCC GCGCGGCGGA CTACGACGAA
GCCCTCCGCC TCTGCAGCGA GCACGAGTTC GGCAACGGCG TGGCCATCTT CACCCGCGAC
GGCGACGCTG CCCGTGACTT CGCCAGCCGG GTGCAGGTGG GTATGGTGGG CATCAACGTG
CCCATCCCGG TGCCCATTGC CTACTACACG TTCGGCGGCT GGAAGGCCTC CGGCTTCGGG
GACCTGAACC AGCACGGGGC CGACGCGTTC CGTTTCTACA CCAAGACCAA GACCGTGACC
ACCCGCTGGC CCTCCGGCAT CCGCCAGGGC GCCAGCTACG TGATGCCGGA AGGCAGCTGA
 
Protein sequence
MVRELSHYID GQRVHGTSGR FGDVYDPCTG EVQARLPLAS ADEVRNAIAS AEKAQVEWGA 
MNPQRRGRIL LKFVDLVNEH LDELAALLSS EHGKTLPDAK GDIQRGIEVV EFAAGAPHLL
KGEFSDNAGT GIDVHSLRQP LGVVAGITPF NFPAMIPLWK SGPALAAGNA FILKPSERDP
SVPLRLAELF TEAGVPDGVF NVINGDKEAV DALLEDERVK AIGFVGSTPI AQYIYATAAA
HGKRAQCFGG AKNHMVIMPD ADLDMAVDAL MGAGYGSAGE RCMAISVAVP VGKETADALV
SKLEERVKHL KVGHSLDKDS DFGPVVAASA KERIEGYIQS GVDQGATLVA DGRGLTVDGY
GGGFWVGPTL FDNVTKDMKI YKEEIFGPVL SVLRAADYDE ALRLCSEHEF GNGVAIFTRD
GDAARDFASR VQVGMVGINV PIPVPIAYYT FGGWKASGFG DLNQHGADAF RFYTKTKTVT
TRWPSGIRQG ASYVMPEGS