Gene Mkms_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1237 
Symbol 
ID4614433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1323731 
End bp1325302 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content69% 
IMG OID639790912 
Productgamma-aminobutyraldehyde dehydrogenase 
Protein accessionYP_937239 
Protein GI119867287 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03374] 1-pyrroline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.399844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.627262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTATG CGCAGCAGGG GGGATGTGGG TCCGATAGCG TTGGACCCAT GACAGCGCTT 
GCGTCGAACG CCTCGGCGTC CAACTCAGTG GCCAGCAGCT GGATCGATGG TGCGCCGGTG
CAGACCGGCG GCGGTTCGTT CCAGATCGTC AACCCCGCGA CCGGTGCGGT CGTCACAGAA
TATGCCCGCG CGGCGAACGT CGACGTGGAT GTGGCCGTCG CCGCGGCCCG GGCGGCCCTG
CCCGGATGGG CGACCGCGAC CCCGGCCGAA CGTTCGGCGG TGCTGGCCAA GCTGGCGAAG
CTGGCCGACG AGCACACCGA CGTGCTGGTC GCCGAGGAGG TCAGCCAGAC CGGGAAGCCG
GTGCGGCTGG CCCGCGAGTT CGACGTGCCC GGCAGCGTCG ACAACATCGA CTTCTTCGCA
GGCGCGGCCC GGCACCTGGA GGGCAAGGCG ACCGCCGAGT ACTCCGGTGA TCACACCTCG
AGCATCCGGC GGGAGGCGAT CGGCGTCGTC GCGACCATCA CCCCCTGGAA CTATCCGCTT
CAGATGGCGG TGTGGAAGGT GCTGCCCGCG CTGGCCGCCG GCTGCACCGT GGTGATCAAG
CCCTGCGAGC TGACGCCGCT GACCACGCTG ACGTTGGCGC GGCTGGCCGG TGAGGCGGGG
CTGCCGCCAG GTGTGCTCAA CGTGGTCACC GGTTTCGGCG CCGACGTGGG CACCGCACTG
GCGGGGCACC CCGGGGTCGA TCTGGTGACG TTCACCGGGT CGACGGTCGT GGGCCGCAAG
GTGATGTCGG CGGCCGCGGT GCACGGGCAC CGCACGCAAC TCGAACTCGG CGGCAAGGCC
CCGTTCGTGG TGTTCGACGA CGCGGACCTC GACGCCGCGA TCCAGGGAGC GGTCGCCGGT
TCGCTGATCA ACTCCGGCCA GGACTGCACG GCCGCGACGC GCGCGATCGT GGCCCGTGAT
CTCTACGACG ACTTCGTGGC CGGCGTCGCC GAGGTGATGG GCAAGATCGT CGTCGGCGAT
CCGGAGGATC CCGACACCGA CCTCGGTCCG CTGATCTCGA CGGCCCACCG CGACAAGGTG
GCGGGCATGG TGTCGCGGGC ACCGGCCGAG GGCGGTCGCG TCGTGACCGG CGGGACGATC
CCCGATCTGC CGGGTTCGTT CTACCGCCCG ACCCTGATCG CCGACGTCGG CGAGCAGTCC
GAGGTGTACC GCGAAGAGAT CTTCGGTCCG GTGCTCACGG TGCGCTCGTT CACCGACGAC
GACGATGCGC TGCGCCAGGC CAACGACACC GATTACGGGT TGGCGGCCTC TGCGTGGACG
CGTGACGTCT ATCGCGCGCA GCGGGCGTCC CGTGAGATCA ACGCCGGTTG CGTGTGGATC
AACGACCACA TCCCGATCAT CAGCGAGATG CCGCACGGCG GCGTGGGCGC ATCAGGCTTC
GGCAAGGACA TGAGCGACTA CTCGTTCGAG GAGTACCTCA CGATCAAGCA CGTGATGAGC
GACATCACCG GGGTGGCCGA CAAGGAGTGG CACCGGACGG TGTTCAAGAA GCGAAGCGGA
TTGAACCATT AG
 
Protein sequence
MRYAQQGGCG SDSVGPMTAL ASNASASNSV ASSWIDGAPV QTGGGSFQIV NPATGAVVTE 
YARAANVDVD VAVAAARAAL PGWATATPAE RSAVLAKLAK LADEHTDVLV AEEVSQTGKP
VRLAREFDVP GSVDNIDFFA GAARHLEGKA TAEYSGDHTS SIRREAIGVV ATITPWNYPL
QMAVWKVLPA LAAGCTVVIK PCELTPLTTL TLARLAGEAG LPPGVLNVVT GFGADVGTAL
AGHPGVDLVT FTGSTVVGRK VMSAAAVHGH RTQLELGGKA PFVVFDDADL DAAIQGAVAG
SLINSGQDCT AATRAIVARD LYDDFVAGVA EVMGKIVVGD PEDPDTDLGP LISTAHRDKV
AGMVSRAPAE GGRVVTGGTI PDLPGSFYRP TLIADVGEQS EVYREEIFGP VLTVRSFTDD
DDALRQANDT DYGLAASAWT RDVYRAQRAS REINAGCVWI NDHIPIISEM PHGGVGASGF
GKDMSDYSFE EYLTIKHVMS DITGVADKEW HRTVFKKRSG LNH