Gene Mkms_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3351 
Symbol 
ID4611277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3513740 
End bp3515632 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content74% 
IMG OID639793024 
Producthypothetical protein 
Protein accessionYP_939335 
Protein GI119869383 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex
[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.477195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0699162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGG GCAGCGTCGG TGCCGTCGGG GGCGCGGCCG AGCAGCTGTC GTTCGACATG 
GAGGCGCTGT CGCTGCGCGA CACCACGTTC GTCGTCGTGG ACCTGGAGAC CACCGGCGGT
CGCGCCACGG GTGAGCGTCC GGATGCGATC ACCGAGATCG GCGCGGTCAA GGTCCGCGGC
GGTGAGGTGC TCGGTGAGCT CGCCACCCTG GTCGACCCCG GGCGGGCGAT ACCGCCGCAG
ATCGTCTCGC TGACCGGCAT CACCACCGCG ATGGTGTGCG CCGCCCCCCG CATCGAATCG
GTGCTGCCCG CATTCCTCGA GTTCGCGCGC GGCTCGGTGC TGGTGGCCCA CAACGCCGGC
TTCGACATCG GCTTCCTGCG GGCGGCGGCC GAACAGTGCG CGCTGACCTG GCCCCGCCCG
CCGGTGCTGT GCACGGTCAA GCTCGCGCGC CGCGTGCTCA CCCGCGACGA GGCGCCCAGC
GTGCGGCTGT CAGCGCTGGC GCAGTTGTTC CGCGCGAAGA CGACGCCGAC GCACCGGGCC
CTCGACGATG CCCGCGCCAC GGTGGACGTA CTGCACGGGC TGATCGAACG GATCGGCAAC
CAGGGCGTGC ACACCTACAC CGACCTGCGC GCCTACCTGC CCGACGTCAC CCCCGCGCAG
CGCCGCAACC GCCGCCTCGC CGACGGTCTG CCCCACCGGC CGGGGGTGTA CCTGTTCCGC
GGCCCGGGCG ACGAGGTGCT CTACATCGGC ACCGCGGTGG ACCTGCGCCG CCGCGTCGGC
CAGTACTTCA CCGGAGCCGA CCCGCGGGCG CGGATGAAGG AGATGGCGTC CCTGGCCACC
CGCGTGGACC ACGTGGAATG CGCCCACGAA CTCGAGGCGG GCGTGCGTGA GCTGCGCCTG
CTGGCCGCCC ATGCGCCGCC CTACAACCGG CGTTCGAAGT TCCCGCAGCG CTGGTGGTGG
GTGGTGCTGA CCGACGAGCC CTTCCCGCGG TTCTCGGTCG TGCGCGCACC CCGTCACGGT
TCGGCGGTCG GGCCGTTCCG GGCCCGCACG GACGCCGTGC AGACCGCTGA ACTGCTCGCC
CGGTTCACCG GTGTGCGGAC CTGCACCGCC CGGCTCGCCC GCGCGGCCCG GCACGGCGCG
GCCTGCGCCG AGCGTGAACT GTCACCGTGC CCGGCGCCGC GCGACATCGA CGCGGCGGCC
TACGCCCCGG CCCACCGCCG CGCCGCCGAC CTCATCGAGG GCCGCGACGA TGCGGCGCTG
GCCGCGGTGG TCGACGGGAT CGCCGCGCTG GCGGCCGTCA ACCGGTACGA ATCGGCCGCG
CGCCTGCGTG ACCACGCCGC CACGGGCATC GACGTGCTGT GGCGGGGGCA GCGACTGCGT
GCGCTCGCCG ACCAGACCGA GTTGGTCGCG GCCCGTCCGG ACGGCAGCGG TGGATGGGAC
CTCGCCGTCG TGCGGTACGG ACGCCTGGCC GCCGCCGGGT GCGCCCGTCG CGGGGTGCCG
CCGATGCCGG TCGTCGACGC GCTGACCGCC GCGGCGCAGA CGGTGCTGCC CGATCCCGCC
CCGCTTGGCG GTGCGCTGGT CGAGGAGACC GGGCTGATCA CCCGCTGGCT CACCAGCCCC
GGTGTGCGGA TCGTGCGGTG CGAACCCGGG TACGCCACAC CGATCGGCGC GGCGGGCCGC
TGGCTGGGCT GGGCGGATAC GGCGCGTTCG GCACGGTTGG CCGCCGAGCA GACCGGTGCG
GACGCGAGGA CCAGAGAGTC GGTCCCCTCA GAGCTTCTGG GTGAACCGCA CCCAACGCGC
GAGCAGCTTT TCGGCCGCCC CGGAGTCGAT GGTCTCGGTC GCCCGGGCCA GGCCCGCCTC
CCAGGCCGGC ACCCATTTGG CGTCGCTGGA TAG
 
Protein sequence
MGQGSVGAVG GAAEQLSFDM EALSLRDTTF VVVDLETTGG RATGERPDAI TEIGAVKVRG 
GEVLGELATL VDPGRAIPPQ IVSLTGITTA MVCAAPRIES VLPAFLEFAR GSVLVAHNAG
FDIGFLRAAA EQCALTWPRP PVLCTVKLAR RVLTRDEAPS VRLSALAQLF RAKTTPTHRA
LDDARATVDV LHGLIERIGN QGVHTYTDLR AYLPDVTPAQ RRNRRLADGL PHRPGVYLFR
GPGDEVLYIG TAVDLRRRVG QYFTGADPRA RMKEMASLAT RVDHVECAHE LEAGVRELRL
LAAHAPPYNR RSKFPQRWWW VVLTDEPFPR FSVVRAPRHG SAVGPFRART DAVQTAELLA
RFTGVRTCTA RLARAARHGA ACAERELSPC PAPRDIDAAA YAPAHRRAAD LIEGRDDAAL
AAVVDGIAAL AAVNRYESAA RLRDHAATGI DVLWRGQRLR ALADQTELVA ARPDGSGGWD
LAVVRYGRLA AAGCARRGVP PMPVVDALTA AAQTVLPDPA PLGGALVEET GLITRWLTSP
GVRIVRCEPG YATPIGAAGR WLGWADTARS ARLAAEQTGA DARTRESVPS ELLGEPHPTR
EQLFGRPGVD GLGRPGQARL PGRHPFGVAG