Gene Mkms_3788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3788 
Symbol 
ID4611723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4001921 
End bp4003174 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content70% 
IMG OID639793468 
Productamidohydrolase 
Protein accessionYP_939771 
Protein GI119869819 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.463322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.142834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGTC CGACCGTTCT CAAAGCGGCC CGCTGGGCCG ACGTCGAGGC CGGCGTCGTC 
CGCGCACCGG CCGTCGTGGT GATCGAGGGT AACCGCATCC AGTCCGTGAA TCCCGCTGAG
CCGCCGCAGA ATCCGGCTCA GGAGATCGAC CTCGGCGACG TCACCCTGCT GCCCGGCCTG
ATGGACATGG AGCTGAACCT GCTCATCGGC GGACCCGGGG GGCCGGAGGG TCTGCCCAGT
CCGATGCACG GGGTGCAGGA CGACCCGGTG TACCGCACGT TGCGGGCGGC GGTGAACGCC
CGCACCACAC TCGACGCCGG ATTCACCACC GTGCGCAATC TGGGGCTGAT GGTCAAGACC
GGCGGCTACC TGCTCGATGT GGCGCTCCAA CGCGCCGTCG ACCAGGGCTG GCACGCCGGC
CCGCGGATCT ACCCGGCCGG CCACGCCGTC ACCCCGTACG GCGGCCACCT GGATCCGACG
GTCTTCCAGC GCCTGGCACC GGGGATCATG CCGCTGTCGG TGGCCGAGGG GATCGCCAAC
GGCGTCGACG ACGTGCGGAC CTGCGTGCGT TACCAGATCC GCCACGGCGC CAAGTTGATC
AAGGTCTCGG CCTCCGGTGG GGTGATGTCG CACAGCACCG CCCCCGGCGC GCAGCAGTAC
TCCGACGACG AGTTCGCCGC GATCGCCGAC GAGGCCCACC GCGCCGGGGT ACGGGTCGCC
GCACATGCGG TGGGGGACAG CGCGATCCGC GCCTGTATCC GCGCCGGGAT CGACTGCATC
GAACACGGCT TCCTTGCCAC GGACGAGACG ATCCAGATGA TGGTCGATCA CGGCACGTTC
CTCGTCTCGA CCACCTATCT CACCGAGGCG ATGGCGGTCG ACCGCATCGC ACCCGAGCTG
CGCCGCAAGG CCGAGGAGGT GTTTCCCCGG GCTCAGGCGA TGCTGCCGAA GGCGATCGCC
GCCGGTGTGC GCATCGCGTG CGGCACCGAC GCCCCGGCGG TGCCGCACGG ACAGAACGCC
AAAGAGCTGT GTGCGCTCGT GTCCCGGGGC ATGACGCCCA TGCAGGCGCT GCGCGCGGCG
ACCATCACGT CCGCAGAGCT CATCGAGGCC GACGGCGAAC TCGGCCGGCT CGCCCCCGGC
TATCTCGCCG ACATCATCGC GGTGCCCGGC GATCCGTCGA GCGACATCGC GACCACGCTC
GACGTGCGGT TCGTGATGAA GGACGGTGTC GTCCACAAGC GCGGCACCGT CTGA
 
Protein sequence
MTGPTVLKAA RWADVEAGVV RAPAVVVIEG NRIQSVNPAE PPQNPAQEID LGDVTLLPGL 
MDMELNLLIG GPGGPEGLPS PMHGVQDDPV YRTLRAAVNA RTTLDAGFTT VRNLGLMVKT
GGYLLDVALQ RAVDQGWHAG PRIYPAGHAV TPYGGHLDPT VFQRLAPGIM PLSVAEGIAN
GVDDVRTCVR YQIRHGAKLI KVSASGGVMS HSTAPGAQQY SDDEFAAIAD EAHRAGVRVA
AHAVGDSAIR ACIRAGIDCI EHGFLATDET IQMMVDHGTF LVSTTYLTEA MAVDRIAPEL
RRKAEEVFPR AQAMLPKAIA AGVRIACGTD APAVPHGQNA KELCALVSRG MTPMQALRAA
TITSAELIEA DGELGRLAPG YLADIIAVPG DPSSDIATTL DVRFVMKDGV VHKRGTV