Gene Mkms_4867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4867 
Symbol 
ID4616282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5094793 
End bp5095803 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content70% 
IMG OID639794558 
Productallantoicase 
Protein accessionYP_940847 
Protein GI119870895 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG4266] Allantoicase 
TIGRFAM ID[TIGR02961] allantoicase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG CGCATTCACA TTTCCTTGCA CTGCCCGATC TGGCCTCCCG CGCGGCCGGC 
GCCGCCGTCG TGTGGGCCAA CGACGAACTG TTCGCCGAGA AGGAGAACCT GATCATCCCG
GCGGCCGCCC AGCACCGGCC TGCGACGTTC GGCCACAAGG GTCAGGTGTA CGACGGGTGG
GAGACGCGCC GCAGCCGCGG CGCCGACAGG GAGTCCTGCG ATTCGGCGAT CATCCGGCTC
GGTGCACCCG GCGTCGTACG CGGCGTGGTC GTGGACACCG CGTGGTTCAC CGGCAACTAT
CCGCCGGAGG TGTCGGTCGA GGCCGCCTAC GTTCCCGGCC ATCCTTCGAT TGAGGAACTG
CTCGGCCGGC ACTGGACGAC GATCGTCGAG CGCAGCCCGG TCGACGGCGA CACCCGCAAT
CCGTTCGAGG TGGCCTCGGG TCGCCGCTGG TCGCACGTGA GGTTGTCGAT GTATCCCGAC
GGCGGTGTGG CCCGGCTTCG TGTGCACGGT GAGGGCCTGC TCGACCCGCA GTTCGCCGAA
TTACCGCTCG ACCTCGCGGC GTTGGAGCAC GGTGGCCGAA TCACGGGCTG CTCCAACATG
TTCTACAGCT CGCCGAACAA TCTGCTGCTG CCGGGCACGG CCCGCACCAT GGGCGACGGT
TGGGAGACCT CGCGGCGCCG CGACGCCGGA AACGACTGGG TGCAGGTGCG ATTGGCGGCC
AGGGGCGTGT TGAGCTTCGC CGAACTCGAC ACGTCGTATT TCATCGGCAA CGCGCCCGGT
TCCGCGCGGC TGCGGGGACG CGACGGTGAC GGCGAATGGT TCGACCTGTT GGACGAGGTG
GCGTTGCAGC CCGACACCCG GCACCGCTTC CTCATCGCGT CCGAGCGGCC GGTGACCGAA
GCGCGCCTCG ACGTGTTCCC CGACGGGGGG ATGGCGCGGC TGCGCCTGTT CGGCCGGCTC
ACCGACGACG GCCTGCGCGC AGTCGCCGAG CGGTGGAGCC TCACCGGCTG A
 
Protein sequence
MSTAHSHFLA LPDLASRAAG AAVVWANDEL FAEKENLIIP AAAQHRPATF GHKGQVYDGW 
ETRRSRGADR ESCDSAIIRL GAPGVVRGVV VDTAWFTGNY PPEVSVEAAY VPGHPSIEEL
LGRHWTTIVE RSPVDGDTRN PFEVASGRRW SHVRLSMYPD GGVARLRVHG EGLLDPQFAE
LPLDLAALEH GGRITGCSNM FYSSPNNLLL PGTARTMGDG WETSRRRDAG NDWVQVRLAA
RGVLSFAELD TSYFIGNAPG SARLRGRDGD GEWFDLLDEV ALQPDTRHRF LIASERPVTE
ARLDVFPDGG MARLRLFGRL TDDGLRAVAE RWSLTG