Gene Mkms_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4989 
Symbol 
ID4612666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5228865 
End bp5229857 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content70% 
IMG OID639794681 
ProductDNA polymerase III subunit epsilon 
Protein accessionYP_940968 
Protein GI119871016 
COG category[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCC ACGGTTGGGG AAGACCGGCG GTCGACACCG GTACAGGCTG GGCCGTCGTC 
GATGTCGAGA CGTCGGGTTT CCGGCCCGGG CAGGCGCGCA TCGTCAGCCT GGCCGCACTC
GCGGTGGGTG ACGACGGCAA CGTCGAACAG AGCCTGGCCA CCCTGCTGAA TCCGGGTGTC
GACCCGGGGC CCACGCATGT GCACGGGCTG ACCGCCGAGA TGCTCGAGGG TGCGCCCCGC
TTCGGTGACG TCGTCGCCGA CCTCGCCGAA CTGCTGCGCG GTCGCACGCT CGTCGCGCAC
AACGTCGGAT TCGACTACTC GTTCCTGACC GCCGAGGCCG AACTCGTCGG CGCGGAACTG
CCGATCGACT CGGTGATGTG CACCGTCGAA CTCGCCCGCC GCCTCGACCT GGGGACGGAG
AACCTGCGGT TGGAGACCCT CGCGGCGCAC TGGGGTGTGC CGCAACTCAA ACCGCACGAT
GCGCTCGACG ACGCTCAGGT CCTCGCGCAG ATCCTCAAAC CGACGCTGGC GCGCGCCCGC
GAGCGCAGGG CCTGGCTGCC GACGCGTTCG GTGAGCCGGC GGCGGTGGCC CAACGGCCGG
GTCACCCACG ACGACCTGCA CCCGTTGAGG ATGGTGGCCG CGCGGCTGCC CTGCGCGTAC
CTGAATCCCG GCCGCTACAT CGCGGGCCGC CCGCTGGTGA AAGGTATGCG CGTCGCGGTC
GCCGCGGAGG TCACGCGCAC TTACGAAGAG CTGATCGAGC GGTTGCTCAC CGCCGGGCTG
GCCTACACCG ACGCGGTGGA CACGGAGACC TCACTGGTCA TCTGCAACCA GCCCGATGTC
GAACAGGGCA AGGGCTACCA GGCTCAGGAG CTCGGCGTCC CGGTGCTCTC GGACGCCGAC
TTCCTGCGGG CCCTCGACCA CGTCGTCGGG GGCACCGGTA TCGAGGAGTT CTTCGACGCC
ACCACGGTCG GCGATCAGTT CGCGCTGTTC TAG
 
Protein sequence
MVSHGWGRPA VDTGTGWAVV DVETSGFRPG QARIVSLAAL AVGDDGNVEQ SLATLLNPGV 
DPGPTHVHGL TAEMLEGAPR FGDVVADLAE LLRGRTLVAH NVGFDYSFLT AEAELVGAEL
PIDSVMCTVE LARRLDLGTE NLRLETLAAH WGVPQLKPHD ALDDAQVLAQ ILKPTLARAR
ERRAWLPTRS VSRRRWPNGR VTHDDLHPLR MVAARLPCAY LNPGRYIAGR PLVKGMRVAV
AAEVTRTYEE LIERLLTAGL AYTDAVDTET SLVICNQPDV EQGKGYQAQE LGVPVLSDAD
FLRALDHVVG GTGIEEFFDA TTVGDQFALF