Gene Noca_4626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4626 
Symbol 
ID4596082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4904236 
End bp4905732 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content72% 
IMG OID639779235 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_925808 
Protein GI119718843 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.228991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGA CAATCCCGCA CTGGATCGAC GGCCTCCCGC AGCCGGGCGC GGCCACCCGC 
CTCGGCGACG TCACCAACCC CGCCACCGGA CAGGTGACCG GCCAGGTGGT CCTCGCGGAC
GGCGCCGACG TCGACGCCGC GGTCGCCTCG GCGAAGACGG CGTCGACCGC CTGGGCGCGG
ACCTCGATCA GCGCCCGCAC CCAGGTGGTG TTCCGGTTCC GCGAGCTGCT CAACGAGCGG
AAGGCCGAGC TGGCCGCGAT CATCACCGCC GAGCACGGCA AGGTGCTCTC GGACGCACTG
GGCGAGGTGT CCCGCGGCCA GGAGGTGGTC GAGTTCGCCT GCGGCATCCC GCACCTGCTC
AAGGGCGGCA ACAGCCCCCA GGTCTCGACC GGCGTCGACG TGCACAGCAT CCGCCAGCCC
CTCGGCGTGG TCGGCATCAT CAGCCCGTTC AACTTCCCCG CGATGGTCCC GATGTGGTTC
TTCCCGATCG CGATCGCGGC CGGCAACGCG GTCGTGCTCA AGCCCAGCGA GAAGGACCCC
TCCGCCGCGA TCTGGATGGC CCGGCTCTGG CAGGAGGCCG GCCTGCCCGA CGGCGTCTTC
ACGGTGCTGC AGGGCGACAA GGTCGCCGTG GACGGGCTGC TCGACCACCC GGACGTCGCG
GCGATCAGCT TCGTCGGCTC CACCCCGATC GCGGAGTACG TCTACGAGCG GGCGAGCCGC
ACCGGCAAGC GGGTGCAGGC CCTGGGCGGG GCGAAGAACC ACATGGTGGT GCTGCCCGAC
GCCGACCTCG ACCTGGCCGC CGATGCCGCG GTCAGCGCCG GCTACGGATC GGCCGGCGAG
CGGTGCATGG CGATCAGCGT GGTCGTCGCG GTCGGCGGCA CCGGCGACGA CCTGGTCGAG
CGGATCGCTG CACGCACCAC CGGGCTCCGC GTCGGCGACG GCACCCGCGA GTCGGACATG
GGGCCGCTGG TGACCGCGGC GCACCGCGAC AAGGTGGCGT CGTACGTCGA CGCCGGCGAG
GCCGAGGGCG CCGCCCTCGT CGTCGACGGG CGCAAGGTCG ACGCGGACGG CGAGCAGGAC
GGGCACTGGC TCGGACCGAC GCTGTTCGAC CACGTCACCC CGCAGATGAG CATCTACACC
GACGAGATCT TCGGCCCGGT GCTGAGCGTG GTCCGCGCCG AGACCTACGC GGAGGCGATC
GAGCTGGTCA ACGCGAACCG CTACGGCAAC GGCACCGCGA TCTTCACCGG CAACGGCGGC
GCCGCCCGCG CCTTCGAGCA GGACGTCGAG GTCGGCATGA TCGGCGTGAA CGTCCCGATC
CCGGTCCCGA TGGCGTACTA CTCCTTCGGC GGCTGGAAGG CCTCGCTGTT CGGCGACACC
CACGCCCACG GCATCGAGGG CGTGCACTTC TTCACCCGGG GCAAGGTCGT CACCACCCGC
TGGCCCGACC CCGGCAGCGC CGGCGGCCTC GAGCTGGCCT TCCCGAAGAA CCACTGA
 
Protein sequence
MTTTIPHWID GLPQPGAATR LGDVTNPATG QVTGQVVLAD GADVDAAVAS AKTASTAWAR 
TSISARTQVV FRFRELLNER KAELAAIITA EHGKVLSDAL GEVSRGQEVV EFACGIPHLL
KGGNSPQVST GVDVHSIRQP LGVVGIISPF NFPAMVPMWF FPIAIAAGNA VVLKPSEKDP
SAAIWMARLW QEAGLPDGVF TVLQGDKVAV DGLLDHPDVA AISFVGSTPI AEYVYERASR
TGKRVQALGG AKNHMVVLPD ADLDLAADAA VSAGYGSAGE RCMAISVVVA VGGTGDDLVE
RIAARTTGLR VGDGTRESDM GPLVTAAHRD KVASYVDAGE AEGAALVVDG RKVDADGEQD
GHWLGPTLFD HVTPQMSIYT DEIFGPVLSV VRAETYAEAI ELVNANRYGN GTAIFTGNGG
AARAFEQDVE VGMIGVNVPI PVPMAYYSFG GWKASLFGDT HAHGIEGVHF FTRGKVVTTR
WPDPGSAGGL ELAFPKNH