Gene Mmcs_5242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5242 
Symbol 
ID4114069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5527874 
End bp5529814 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content68% 
IMG OID638034398 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_642399 
Protein GI108802202 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.860523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACCGC ACAACCTCTT TCACTCCGGT TCGATGGAGA CGGTGGTGAA CCACATCATC 
GCCGACTGCA CCGACACCGT CGGCATCGAG GAAGGAGACG TCTTCATCAC CAACGACCCC
TACAAGGGCG TGTTGCACAT GCCGGACGTC ACCATGATCG AGCCGGTGTT CCACGAAGGT
GTCCGGATCG GATGGGTCGG GACGTGTGCG CATGTGCTCG ACATCGGCGG GATGACCCCG
TCGAGTTGGT CGCCGGCCGC GCGGGAGGTG TATCAGGAGG GCCTGATCCT GCCGCCCACG
AAGATCATCT CCGGTGGCCG GACCCGCCAC GACGTGTGGA ACCTGATCCT CGCGGCATCC
CGGCTGCCCG CCAACCTCGG GCTCGACCTC AAGGCGATGA TCGCCGCGAA CAACCATGCG
CGACAGGGCA TGCTGCGCCT AGTCGACCGC TACGGCGCCG ACGTCGTCAC CACGGTGATG
ACGACGATGC TCGACCGCTC CGAAGCCCAG GTGCGCGAGC GGTTGCTCGC TGTCCCGGAC
GGAACCACCC GGGCGCGTAC GTATTTCGAC CACGACGGGC ACAAGTCCAC GCTCGCGCGG
GTCGAGGTGG AGCTCCGCAA GGACGGCGAC CGGCTGGTGT TCGACTACGG CCGCACCGCC
GAACAGCTCC CGGGGTTCTT CAACTGCACG ATGTCCGGTC TGCGCGGCGG CGTGTTCTCG
GCGATCCTGC CGGTGCTGGC CCACGACATC CCGTGGAACA GCGGCGTCAT GCGGGCCATC
GAGGTGACCG CACCCGAGGG GACCATCGTC AACGCCAGGC ACCCGGCGCC GTGCGGTGCC
TCGACCACGG GCGCCACCAC GATCGTGGAG AGCACGGCGG GCAGCGCGTT GTCGACACTG
GTCAGCGCGC ACGACGAATT ACGTGGTGAG GCAAGGGCAG TCACCACGGG CGGGCTGATG
GTCTTCCACA TCGCCGGCCG CAACCAGTAC GGCGAACCGT ACGGCGGCGC GATGACCGAG
GTGCTGGCCG GGGGCGCCGG TGCCAACGTC AACCGCAACG GAGTCGACTA CCGCGGACCC
AACGAGATCC TCACCGGACA GTTCAACAAC GTCGAGGGCG AGGAGGCCGT CTTCCCGCTG
CTGTACCTGA ACCGGTCGGC CAACACCGAC GGTGGGGGAG CGGGCCGCCA TCACGGCGGT
GTGTCGGTGA GTTCGTCGTT CGTACTGCAC GACACCGATG CGCTGCACGG GGTGATGGCG
GGCCACAGCA TGTCGATGCC CAACTCGCTC GGCCTTCACG GCGGGCTACC GGGATCGACG
CATCAGGTGA CGATCGTGCG CGGGGGTGAG CCCGTGGTCT ACACCGGTTC TCCCGGGGAG
ATCCACCTTG CAGCAGGCGA TGTCGTCGAC TGGAGCTTCC ACGGCGGGGG CGGCTGGGGT
GACCCGCTCG ACGCCGACCC GGACGAGGTG CTGGCAGACG TCACGGCCGG CCGGATCTCC
ACCGAGAGCG CGTCGCGGCT CTACGGCGTG GTCCTGACGG CAGGGGCGGT GGACCGTGAG
GAGACCACCG CGCGCCGGAA CCGGGAACGC GCCCGTCGTC GACTGTGGGC CCGGCACAGG
GTGTTCGGTG CGCGGCAGCT CGACATGGTC GGTGACGCGC GCCGGATCGG TGACCGTCTC
GTGCTGACGC ACGACGGTGT GTCGGCGGTG TACGCGTGCG ACTGCGGGAA CGTGATCGCC
CCTGCCGACG AGAACTGGAA GGACTACGCG GCGCATTCCC GGTTGAGCGA ACACGATCTG
GGACCGAAGG TCCGGTTGCA TCCGGGTCTG CGCGCCGACG CGTATGCGTG CGCGGGATGC
GGCACGCTGC TGGCGGTCGA GATCCGGGCA CACGAGGACG AACCGCTGCG CGATCTGGAG
TTGGCCGGGC AGAGCGGCTG A
 
Protein sequence
MGPHNLFHSG SMETVVNHII ADCTDTVGIE EGDVFITNDP YKGVLHMPDV TMIEPVFHEG 
VRIGWVGTCA HVLDIGGMTP SSWSPAAREV YQEGLILPPT KIISGGRTRH DVWNLILAAS
RLPANLGLDL KAMIAANNHA RQGMLRLVDR YGADVVTTVM TTMLDRSEAQ VRERLLAVPD
GTTRARTYFD HDGHKSTLAR VEVELRKDGD RLVFDYGRTA EQLPGFFNCT MSGLRGGVFS
AILPVLAHDI PWNSGVMRAI EVTAPEGTIV NARHPAPCGA STTGATTIVE STAGSALSTL
VSAHDELRGE ARAVTTGGLM VFHIAGRNQY GEPYGGAMTE VLAGGAGANV NRNGVDYRGP
NEILTGQFNN VEGEEAVFPL LYLNRSANTD GGGAGRHHGG VSVSSSFVLH DTDALHGVMA
GHSMSMPNSL GLHGGLPGST HQVTIVRGGE PVVYTGSPGE IHLAAGDVVD WSFHGGGGWG
DPLDADPDEV LADVTAGRIS TESASRLYGV VLTAGAVDRE ETTARRNRER ARRRLWARHR
VFGARQLDMV GDARRIGDRL VLTHDGVSAV YACDCGNVIA PADENWKDYA AHSRLSEHDL
GPKVRLHPGL RADAYACAGC GTLLAVEIRA HEDEPLRDLE LAGQSG