Gene Mmcs_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3011 
Symbol 
ID4111843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3181186 
End bp3183948 
Gene Length2763 bp 
Protein Length920 aa 
Translation table11 
GC content68% 
IMG OID638032140 
ProductDNA polymerase I 
Protein accessionYP_640174 
Protein GI108799977 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGGCGG GGGTGCATAG AGTTGAGCCC GTGAGCCCAG CCAGCACCTC GACCGACAAG 
AAGACGGCCA CCAGCCAGAA GGCGGACGAC AAGCCGACAC TGATGCTGCT GGACGGCAAC
TCGCTGGCGT ACCGCGCCTT CTACGCACTC CCCGCCGAGA ACTTCAAGAC CCAGGGCGGC
CTGACCACCA ACGCGGTCTA CGGATTCACC GCGATGCTGA TCAACCTGCT CCGCGACGAG
CAGCCCTCCC ATGTGGCCGC CGCGTTCGAC GTCTCCCGGC AGACGTTCCG CGTCGACAAG
TACCCGGAGT ACAAGGCCGG CCGCTCGTCG ACTCCGGACG AGTTCCGCGG GCAGATCGAC
ATCGCCAAGG AGGTGCTCGT CGCGCTTGGC ATCGCCGTGC TGGCCGAACC CGGCTTCGAG
GCCGACGACA TCATCGCCAC GCTCGCCACC CAGGCCGAGG GTGAGGGTTA CCGGGTGCTC
GTCGTGACCG GTGACCGCGA CGCGCTCCAG CTCGTCAGCG ACGACGTGAC GGTGCTCTAT
CCCCGTAAGG GCGTCAGCGA CCTGACCCGC TTCACGCCCG ACGCGGTGCA GGAGAAGTAC
GGGCTGACGC CGCAGCAGTA CCCGGACTTC GCCGCGCTGC GGGGCGATCC GAGCGACAAC
CTGCCCGGCA TCCCGGGCGT GGGGGAGAAG ACGGCGACCA AGTGGATCGC CGAGTACGGG
TCGCTGCAGG CCCTGGTGGA CAACGTCGAC CAGGTCAAGG GCAAGGTCGG CGATGCTCTG
CGGGCCAACC TGTCCCACGT GGTGCTCAAC CGCGAACTCA CCGACCTCGT CAAGGACGTC
CCGCTTCCGC ACACTCCCGA CACCCTGCGG GTGCAGCCGT GGGACCGCGA TCAGCTCCAC
CGGCTGTTCG ACGACCTCGA GTTCCGGGTG CTGCGCGACC GCCTGTTCGA GACTCTGGCG
GCCGTCGAAC CCGAGGTCGA ACACGGCTTC GACGTGCGGG GCAAGGCGCT CGAACGGGGT
GAACTCGCGG CCTGGCTGTC CGAACACAGC CTCGGCAACC GGTTCGGGCT CGCGGTCGTC
GGCACCCATC TCGCCTACGA CGCCGACGCC ACCGCGCTGG CCATCGTCGC CGCCGACGGC
GACGGTCGCT ACATCGACAC GACCTCGCTC GACCCGGACG ACGAGGCGGC GCTGGCGTCG
TGGCTGGCCG ACCCGGGCCC GCCGAAGGCG CTGCACGAAG CCAAGCTCGC CATGCACGAC
CTCGCCGGTC GCGGCTGGAA GCTCGCCGGG GTCACCTCCG ACACGGCGCT GGCCGCCTAC
CTGGTGCGGC CCGGGCAGCG CAGCTTCAGC CTCGACGATC TGTCGCTGCG CTACCTGCGT
CGGGAACTGC GCGCGGACAA CCCTGCGCAA CAACAACTCT CACTGCTCGA CGACAGTGAC
GGCGGCGACG ACCAAGCCGT CCAGACGCTG ATCCTGCGGG CCGTGGCGGT GCTCGACCTC
GCCGACGCGC TCGACGAGGA ACTCGCCCGC ATCGACTCGT CGTCGCTACT GGGCCGGATG
GAGTTGCCGG TGCAGCGGGT GCTCGCCGAG ATGGAGCACA CCGGTATCGC CGTCGACATC
GATCAACTGC GGCAGTTGCA GAGCGAGTTC GCCGACCTGA TCCGCGATGC CGCCGAGGCG
GCCTACGCGG TGATCGGCAA GCAGATCAAC CTCGGCTCGC CCAAACAGCT GCAGGCGGTG
CTGTTCGACG AACTCGAGAT GCCCAAGACT AAGCGGACCA AGACCGGATA CACCACGGAT
GCCGATGCAC TGCAGTCGTT GTTCGACAAG ACCGGTCATC CTTTCCTGCA GCATCTGCTG
GCCCATCGCG ACGCCACCCG GCTCAAGGTG ACCGTCGACG GCCTCTTGAA CTCCGTCGCC
TCCGACGGCC GTATCCACAC GACGTTCAAC CAGACGATCG CCGCGACCGG TCGGCTGTCG
TCGACCGAAC CGAATCTGCA GAACATCCCG ATCCGCACCG AGGCGGGCCG GCGTATCCGC
GACGCCTTCG TGGTCGGAGA CGGATACACC GAAGTCATGA CCGCCGACTA CAGCCAGATC
GAGATGCGCA TCATGGCGCA CCTGTCACAG GACGCCGGTC TGATCGAGGC GTTCAACACC
GGAGAGGACC TGCACTCGTT CGTCGCCTCG CGCGCGTTCG ACGTGCCGAT CGACGAGGTG
ACCCCGGACC TGCGCCGCCG GGTCAAGGCG ATGTCGTACG GGTTGGCCTA CGGGCTGAGC
GCGTACGGAT TGGCCGCGCA GCTCAAGATC TCCACCGAAG AGGCCAAGGT CCAGATGGAC
CAGTACTTCG CCCGTTTCGG CGGTATCCGC GATTACCTGC GCGACGTCGT CGACCAGGCC
CGCAAGGACG GCTACACCTC GACGGTGTTC GGTCGCAGGC GGTATCTGCC CGAACTCGAC
AGCAGCAACC GCAACGTCCG CGAGGCCGCG GAGCGGGCCG CGCTCAACGC CCCGATCCAG
GGCAGCGCCG CGGACATCAT CAAGGTCGCG ATGATCGACG TCGATCAGGC GATCAAGGAC
GCCGGCCTCT CGTCACGGAT GCTGCTGCAG GTGCACGACG AGTTGCTGTT CGAGGTCGCC
GACGGCGAGC GCGACACGCT CGAGAAGCTG GTCCGCGACA AGATGGGCAG CGCGTACGCC
CTCGACGTTC CGCTCGAGGT CAGTGTCGGG TACGGCCGGA GCTGGGACGC GGCGGCGCAC
TGA
 
Protein sequence
MSAGVHRVEP VSPASTSTDK KTATSQKADD KPTLMLLDGN SLAYRAFYAL PAENFKTQGG 
LTTNAVYGFT AMLINLLRDE QPSHVAAAFD VSRQTFRVDK YPEYKAGRSS TPDEFRGQID
IAKEVLVALG IAVLAEPGFE ADDIIATLAT QAEGEGYRVL VVTGDRDALQ LVSDDVTVLY
PRKGVSDLTR FTPDAVQEKY GLTPQQYPDF AALRGDPSDN LPGIPGVGEK TATKWIAEYG
SLQALVDNVD QVKGKVGDAL RANLSHVVLN RELTDLVKDV PLPHTPDTLR VQPWDRDQLH
RLFDDLEFRV LRDRLFETLA AVEPEVEHGF DVRGKALERG ELAAWLSEHS LGNRFGLAVV
GTHLAYDADA TALAIVAADG DGRYIDTTSL DPDDEAALAS WLADPGPPKA LHEAKLAMHD
LAGRGWKLAG VTSDTALAAY LVRPGQRSFS LDDLSLRYLR RELRADNPAQ QQLSLLDDSD
GGDDQAVQTL ILRAVAVLDL ADALDEELAR IDSSSLLGRM ELPVQRVLAE MEHTGIAVDI
DQLRQLQSEF ADLIRDAAEA AYAVIGKQIN LGSPKQLQAV LFDELEMPKT KRTKTGYTTD
ADALQSLFDK TGHPFLQHLL AHRDATRLKV TVDGLLNSVA SDGRIHTTFN QTIAATGRLS
STEPNLQNIP IRTEAGRRIR DAFVVGDGYT EVMTADYSQI EMRIMAHLSQ DAGLIEAFNT
GEDLHSFVAS RAFDVPIDEV TPDLRRRVKA MSYGLAYGLS AYGLAAQLKI STEEAKVQMD
QYFARFGGIR DYLRDVVDQA RKDGYTSTVF GRRRYLPELD SSNRNVREAA ERAALNAPIQ
GSAADIIKVA MIDVDQAIKD AGLSSRMLLQ VHDELLFEVA DGERDTLEKL VRDKMGSAYA
LDVPLEVSVG YGRSWDAAAH