Gene Mmcs_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1841 
Symbol 
ID4110675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1983115 
End bp1984851 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content73% 
IMG OID638030961 
Productallophanate hydrolase 
Protein accessionYP_639006 
Protein GI108798809 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02713] allophanate hydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCCG TCGAGCGGGT CCGCGCCGCC TACGCCACGA TCGAGGCCGT CGCGCGGCCC 
GAGGTCTGGA TCTTCCTGCG GCCGTTCGCC GATGCGCTCA CCGACGCCGA GGCCGTCGAC
TCCGCCGTCG CGGCCGGGGC GGACCTGCCG CTGGCCGGGC TGACCGTCGC GGTCAAGAAC
AACGTCGACG TCGCGGGCCT GCCCACGACG GCGGGCTGCC CGGGATACGC AACCGACCCG
GCCGAGGTGG ACGCCCCGGT GGTCGCACGG CTGCGCGCCT CGGGTGCGGT GGTGCTCGGG
GCGACCAATC TCGATCAGTT CGCGACCGGA CTCGTCGGCA CCCGCAGCCC ACACGGCGCC
GTGCGCGACG CCCGCCGCCC CGGCCACATC TCGGGCGGTT CGAGTTCGGG ATCGGCGGTC
GCGGTGGCAC TCGGCATCGC CGACCTCGCC ATCGGCACGG ACACCGCCGG ATCCGGCCGC
GTGCCTGCGG CGCTGCAGGG CATCGTGGGC ATCAAACCCA CCTACGGTGT GGTGCCCACC
GACGGGGTGG TGCCCGCCTG CCGAAGCTAC GACTGCGTCA CGGTGTTCGC GCGCGATCTC
GACACCGCCG ACGCCGCCAT GGGGGTGATG GCGGGTGCGG ACCCGTCGGC GGGCGTGCGG
GCGCGGCCCT TCCCGCCCGA CGCCCCGCTC GCGGCGCCGG CCGTGCCGCT GGTCGGCGTT
CCGCGCGACC TCCCCGGCCT CTCACCGGCC TGGCGGCAGG CGTTCGGCGA GGCGAGGTCC
CGCCTCGAAG GACAGGGTGC GGCCGTGCGC GAGATCGACA TGCGCGCATT CCTCGAAGCG
GCCAGGCTGC TCTACGACGG CGGGCTCGTC GCCGAACGGC ACGAAGCCGT CGGCGATTTC
GTCGACACAC ACCGCGACGA GGTCGACCCC ACCGTCGGTG CGATCATCGC GGCGGCCGGT
ACGGTACCGG CCACCCGGCT GCTTCGGGAC CGGGTACGGC TGGCCGAACT CACCGCCGCG
GCGATGGCCG AACTCGGTGA CTGCGACGCG CTGCTGATCC CGACGACCAC CGACCACCCG
ACCATCGCCG AGGTCGAGGC CGAGCCGATC GCGGTCAACT CACGACTGGG CACGTACACG
AACTTCTGCA ATCTGCTCGA CATGTGCGCG GTCGCCGTGC CGTCCGGAGC CGCCGACGGC
GCCCAGTTCG GTGTCTCGAT CGTCGCCCGG GCGGGCGCCG ACGCGGTGGC GCTCGACCTG
GCGCGCCGTG TCACATCCGG ATCCTCGGAT CCTGTTGTGT CGCAGGCACC GTGGCCGGTC
CGCGCGGGTC TGTCCGCGAC ACCGCTGCTC GTGGTGGGCG CGCACCTGCG CGATCAGCCG
CTGGCGTGGC AACTCGAAGA ACGCGGGGCG CGGTGGCTCG GGCCGGCGGT CACCGCCCCG
CTCTACCGGC TCGCGCGCCT GCACACCACG CCGGCCAAAC CGGGTCTGGT CCGGGTGGGC
GCCGAGAGCG GTACGGCGAT CGGCGGTGAG CTGTGGCTGG TCGGGACCGC GATGCTGGGC
GACTTCCTCG CCGCGCTGCC GTCACCGATG ATGCTCGGCC GGGTGACGCT GTCCGACGGC
ACCGAGGTCG TCGGGTTCGG ATGCGCACTC GACGCGTGGC AGAGCGGTGA GGACATCTCG
TCCTACGGCG ACTGGCGGAA CTACCTGGGC TCCGGTCAGC TCAGCGAGCG CGTGTAA
 
Protein sequence
MTAVERVRAA YATIEAVARP EVWIFLRPFA DALTDAEAVD SAVAAGADLP LAGLTVAVKN 
NVDVAGLPTT AGCPGYATDP AEVDAPVVAR LRASGAVVLG ATNLDQFATG LVGTRSPHGA
VRDARRPGHI SGGSSSGSAV AVALGIADLA IGTDTAGSGR VPAALQGIVG IKPTYGVVPT
DGVVPACRSY DCVTVFARDL DTADAAMGVM AGADPSAGVR ARPFPPDAPL AAPAVPLVGV
PRDLPGLSPA WRQAFGEARS RLEGQGAAVR EIDMRAFLEA ARLLYDGGLV AERHEAVGDF
VDTHRDEVDP TVGAIIAAAG TVPATRLLRD RVRLAELTAA AMAELGDCDA LLIPTTTDHP
TIAEVEAEPI AVNSRLGTYT NFCNLLDMCA VAVPSGAADG AQFGVSIVAR AGADAVALDL
ARRVTSGSSD PVVSQAPWPV RAGLSATPLL VVGAHLRDQP LAWQLEERGA RWLGPAVTAP
LYRLARLHTT PAKPGLVRVG AESGTAIGGE LWLVGTAMLG DFLAALPSPM MLGRVTLSDG
TEVVGFGCAL DAWQSGEDIS SYGDWRNYLG SGQLSERV