Gene Mmcs_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1220 
Symbol 
ID4110057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1319422 
End bp1320993 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content69% 
IMG OID638030341 
Productgamma-aminobutyraldehyde dehydrogenase 
Protein accessionYP_638388 
Protein GI108798191 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03374] 1-pyrroline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTATG CGCAGCAGGG GGGATGTGGG TCCGATAGCG TTGGACCCAT GACAGCGCTT 
GCGTCGAACG CCTCGGCGTC CAACTCAGTG GCCAGCAGCT GGATCGATGG TGCGCCGGTG
CAGACCGGCG GCGGTTCGTT CCAGATCGTC AACCCCGCGA CCGGTGCGGT CGTCACAGAA
TATGCCCGCG CGGCGAACGT CGACGTGGAT GTGGCCGTCG CCGCGGCCCG GGCGGCCCTG
CCCGGATGGG CGACCGCGAC CCCGGCCGAA CGTTCGGCGG TGCTGGCCAA GCTGGCGAAG
CTGGCCGACG AGCACACCGA CGTGCTGGTC GCCGAGGAGG TCAGCCAGAC CGGGAAGCCG
GTGCGGCTGG CCCGCGAGTT CGACGTGCCC GGCAGCGTCG ACAACATCGA CTTCTTCGCA
GGCGCGGCCC GGCACCTGGA GGGCAAGGCG ACCGCCGAGT ACTCCGGTGA TCACACCTCG
AGCATCCGGC GGGAGGCGAT CGGCGTCGTC GCGACCATCA CCCCCTGGAA CTATCCGCTT
CAGATGGCGG TGTGGAAGGT GCTGCCCGCG CTGGCCGCCG GCTGCACCGT GGTGATCAAG
CCCTGCGAGC TGACGCCGCT GACCACGCTG ACGTTGGCGC GGCTGGCCGG TGAGGCGGGG
CTGCCGCCAG GTGTGCTCAA CGTGGTCACC GGTTTCGGCG CCGACGTGGG CACCGCACTG
GCGGGGCACC CCGGGGTCGA TCTGGTGACG TTCACCGGGT CGACGGTCGT GGGCCGCAAG
GTGATGTCGG CGGCCGCGGT GCACGGGCAC CGCACGCAAC TCGAACTCGG CGGCAAGGCC
CCGTTCGTGG TGTTCGACGA CGCGGACCTC GACGCCGCGA TCCAGGGAGC GGTCGCCGGT
TCGCTGATCA ACTCCGGCCA GGACTGCACG GCCGCGACGC GCGCGATCGT GGCCCGTGAT
CTCTACGACG ACTTCGTGGC CGGCGTCGCC GAGGTGATGG GCAAGATCGT CGTCGGCGAT
CCGGAGGATC CCGACACCGA CCTCGGTCCG CTGATCTCGA CGGCCCACCG CGACAAGGTG
GCGGGCATGG TGTCGCGGGC ACCGGCCGAG GGCGGTCGCG TCGTGACCGG CGGGACGATC
CCCGATCTGC CGGGTTCGTT CTACCGCCCG ACCCTGATCG CCGACGTCGG CGAGCAGTCC
GAGGTGTACC GCGAAGAGAT CTTCGGTCCG GTGCTCACGG TGCGCTCGTT CACCGACGAC
GACGATGCGC TGCGCCAGGC CAACGACACC GATTACGGGT TGGCGGCCTC TGCGTGGACG
CGTGACGTCT ATCGCGCGCA GCGGGCGTCC CGTGAGATCA ACGCCGGTTG CGTGTGGATC
AACGACCACA TCCCGATCAT CAGCGAGATG CCGCACGGCG GCGTGGGCGC ATCAGGCTTC
GGCAAGGACA TGAGCGACTA CTCGTTCGAG GAGTACCTCA CGATCAAGCA CGTGATGAGC
GACATCACCG GGGTGGCCGA CAAGGAGTGG CACCGGACGG TGTTCAAGAA GCGAAGCGGA
TTGAACCATT AG
 
Protein sequence
MRYAQQGGCG SDSVGPMTAL ASNASASNSV ASSWIDGAPV QTGGGSFQIV NPATGAVVTE 
YARAANVDVD VAVAAARAAL PGWATATPAE RSAVLAKLAK LADEHTDVLV AEEVSQTGKP
VRLAREFDVP GSVDNIDFFA GAARHLEGKA TAEYSGDHTS SIRREAIGVV ATITPWNYPL
QMAVWKVLPA LAAGCTVVIK PCELTPLTTL TLARLAGEAG LPPGVLNVVT GFGADVGTAL
AGHPGVDLVT FTGSTVVGRK VMSAAAVHGH RTQLELGGKA PFVVFDDADL DAAIQGAVAG
SLINSGQDCT AATRAIVARD LYDDFVAGVA EVMGKIVVGD PEDPDTDLGP LISTAHRDKV
AGMVSRAPAE GGRVVTGGTI PDLPGSFYRP TLIADVGEQS EVYREEIFGP VLTVRSFTDD
DDALRQANDT DYGLAASAWT RDVYRAQRAS REINAGCVWI NDHIPIISEM PHGGVGASGF
GKDMSDYSFE EYLTIKHVMS DITGVADKEW HRTVFKKRSG LNH