Gene Mflv_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3564 
Symbol 
ID4974882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3800753 
End bp3803479 
Gene Length2727 bp 
Protein Length908 aa 
Translation table11 
GC content67% 
IMG OID640457789 
ProductDNA polymerase I 
Protein accessionYP_001134826 
Protein GI145224148 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0954216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.139495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCAG CCAAGACCGC CTCGAAGAAG GCCGCCACAC CGGCCGCCGA CGACACGCCG 
ACGCTGATGC TGCTCGACGG CAACTCGCTG GCCTTCCGGG CGTTCTACGC GCTGCCTGCG
GAGAACTTCA AGACCCAGGG CGGGCTGACC ACCAACGCCG TGTACGGCTT CACCGCGATG
CTCATCAACC TCATCCGCGA CGAGCAGCCC AGCCACATCG CGGCCGCGTT CGATGTCTCC
CGCCAGACCT TCCGCAAGGA GAAGTACCCC GAGTACAAGG AAGGCCGGTC GGCGACGCCC
GACGAGTTCC GCGGCCAGAT CGACATCACC AAAGAGGTCC TCGGTGCGCT CGGCATCACG
GTGCTCGCCG AGGCCGGCTT CGAGGCCGAC GACATCATCG CCACGCTGGC GACGCAGGCC
GAGGGCGAGG GGCACCGCGT CCTGGTGGTC ACCGGTGATC GCGACGCGCT GCAACTGGTC
AGCGACGACG TCACCGTGCT TTACCCCCGC AAGGGAGTCA GCGAGCTGAC CCGCTTCACG
CCGGAGGCGG TTCAGGAGAA GTACGGGCTC ACGCCCGCGC AGTACCCGGA TTTCGCCGCG
CTGCGCGGCG ACCCCAGCGA CAACCTGCCG GGCATCCCGG GCGTGGGGGA GAAGACCGCC
ACCAAGTGGA TCGCCGAATA CGGGTCGTTG CAGGGGCTGG TCGACAACGT CGACAAGGTC
AAGGGCAAGG TCGGCGATGC GTTGCGCGCG CACCTGTCCT CGGTGGTGCT CAACCGTGAG
CTCACCGAGC TGGTGAAGGA CGTGCCGCTC GCCCAGACCC CGGACACGCT ACGCATGCAG
CCCTGGGATC GCGACCACAT CCACCGGCTC TTCGACGATC TCGAGTTCCG TGTCCTGCGT
GACCGGCTCT TCGACACCCT CGCCTCCGCC GATCCGGAGG TCGAGGAAGG CTTCGACGTC
CAGGGCGAGG CGCTGCAGGC CGGGACGCTG GCGCCGTGGC TGGCCGAGAA CAGCGACGGT
AGACGGTTCG GGCTGGCCGT CGTCGGCAAC CATCTCGCCT TCGACAGCGA CGCGACCGCG
CTGACCATCG TGGCCTCCGG CGGCGAGGGC CGCTACATCG ACACCACCGG GCTCGATCCC
GACGACGAGA AGGCGCTGGC CTCCTGGCTG GCCGACCCGC ACGTGCCCAA GGCGCTGCAT
GAGGCCAAGC TTGCGATCCA CGATCTGCAG GGCAGGGGCT GGACGCTGGC CGGCGTCACC
TCCGACACCG CGCTGGCCGC GTACCTGGTC CGCCCGGGAC AGCGCAGCTT CGCTCTCGAC
GACCTGTCGC TGCGCTACCT CAAGCGTGAA CTGCGCGCCG ACAACCCTGA GCAGCAACAA
CTTTCGCTCC TCGATGACAG TGACGGCGTC GACGACCAGG CCGTGCAGAC GCTGCTGTTG
CGTGCCAGCG CGGTGGTCGA CCTCGCCGAC GCCCTCGACG AGGAACTCGC GCGCATCGAC
TCCTCGGCGT TGCTGGGCAA CATGGAACTC CCGGTGCAGC GGGTGCTGGC AGAGCTCGAA
ACCGCAGGCA TCGCCGTCGA TCTGGAGATG CTCTCCGGCC TGCAGAGCGA GTTCGCGGAC
CAGATCCGCG ACGCCGCCGA GGCGGCCTAC GCCGTGATCG GCAAGCAGAT CAACCTCGGC
TCCCCGAAAC AGCTGCAGGT GGTCCTGTTC GACGAGCTCG AGATGCCGAA GACCAAACGC
ACCAAGACCG GCTACACCAC CGACGCGGAT GCGCTGCAGA GTCTCTTCGA CAAGACCGGA
CATCCGTTCC TGCAGCATCT GCTGGCCCAC CGCGACGCGA CGCGGCTGAA GGTGACAGTC
GACGGCCTGC TCAACGCGGT GGCCTCCGAC GGGCGGATCC ATACGACGTT CAACCAGACG
ATCGCGGCAA CGGGCCGGTT GTCGTCCACC GAGCCGAACC TGCAGAACAT CCCGATCCGC
ACCGAGGCCG GCCGCCGTAT CCGCGACGCG TTCGTCGTGG GTGACGGTTA CGGCGAGTTG
ATGACCGCCG ATTACAGCCA GATCGAGATG CGCATCATGG CGCACCTGTC CCGTGACGAG
GGCCTGATCG AGGCGTTCAA CACCGGCGAG GATCTGCACT CGTTCGTCGC GTCGCGGGCC
TTCTCGGTGC CGATCGACGA GGTCACCGCC GAACTGCGGC GGCGGGTCAA GGCCATGTCC
TACGGCCTGG CCTACGGGCT CAGCGCCTAC GGGTTGTCCC AGCAGCTCAA GATCTCGACC
GAAGAGGCCA AAGAGCAGAT GGAACAGTAC TTCGCGCGAT TCGGCGGGGT GCGCGACTAT
CTGCGCGACG TCGTCGACCA GGCCCGTAAA GACGGTTACA CGTCGACGGT GTTCGGTCGC
AGGCGTTACC TCCCCGAACT CGACAGCAGC AACCGGCAGG TGCGGGAGGC CGCCGAACGG
GCGGCCCTCA ACGCGCCGAT CCAGGGCAGT GCCGCCGACA TCATCAAGGT CGCGATGATC
AACGTCGACC AGGCGATCAA GGACGCGGGG CTGAAGTCCC GCACGCTGCT GCAGGTGCAC
GACGAACTCC TCTTCGAAGT CGCCGACGGC GAGCGTGACA CCCTCGACGC CCTGGTACGC
GAGCATATGG GCTCCGCATA CGCCCTCGAC GTGCCGTTGG AGGTGTCTGT CGGCTTCGGG
CGCAGTTGGG ACGCGGCCGC GCACTGA
 
Protein sequence
MSPAKTASKK AATPAADDTP TLMLLDGNSL AFRAFYALPA ENFKTQGGLT TNAVYGFTAM 
LINLIRDEQP SHIAAAFDVS RQTFRKEKYP EYKEGRSATP DEFRGQIDIT KEVLGALGIT
VLAEAGFEAD DIIATLATQA EGEGHRVLVV TGDRDALQLV SDDVTVLYPR KGVSELTRFT
PEAVQEKYGL TPAQYPDFAA LRGDPSDNLP GIPGVGEKTA TKWIAEYGSL QGLVDNVDKV
KGKVGDALRA HLSSVVLNRE LTELVKDVPL AQTPDTLRMQ PWDRDHIHRL FDDLEFRVLR
DRLFDTLASA DPEVEEGFDV QGEALQAGTL APWLAENSDG RRFGLAVVGN HLAFDSDATA
LTIVASGGEG RYIDTTGLDP DDEKALASWL ADPHVPKALH EAKLAIHDLQ GRGWTLAGVT
SDTALAAYLV RPGQRSFALD DLSLRYLKRE LRADNPEQQQ LSLLDDSDGV DDQAVQTLLL
RASAVVDLAD ALDEELARID SSALLGNMEL PVQRVLAELE TAGIAVDLEM LSGLQSEFAD
QIRDAAEAAY AVIGKQINLG SPKQLQVVLF DELEMPKTKR TKTGYTTDAD ALQSLFDKTG
HPFLQHLLAH RDATRLKVTV DGLLNAVASD GRIHTTFNQT IAATGRLSST EPNLQNIPIR
TEAGRRIRDA FVVGDGYGEL MTADYSQIEM RIMAHLSRDE GLIEAFNTGE DLHSFVASRA
FSVPIDEVTA ELRRRVKAMS YGLAYGLSAY GLSQQLKIST EEAKEQMEQY FARFGGVRDY
LRDVVDQARK DGYTSTVFGR RRYLPELDSS NRQVREAAER AALNAPIQGS AADIIKVAMI
NVDQAIKDAG LKSRTLLQVH DELLFEVADG ERDTLDALVR EHMGSAYALD VPLEVSVGFG
RSWDAAAH