Gene Mflv_4893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4893 
SymboldnaE2 
ID4976204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5212620 
End bp5215883 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content69% 
IMG OID640459120 
Producterror-prone DNA polymerase 
Protein accessionYP_001136147 
Protein GI145225469 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0740584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCG TCCTCGAAGG CAAGCCCCGT CGCGCCGGTT GGCCCATCGA CGCACAGGTC 
GGCGACGGCG GGGACAGTCC AGCCTGGTCG AGCAAGCGGG GGCAGTACCG GGCGCCGGAG
TCGAGGGGCC CCGCGACGGG GCGTTCGATG CCCTACGCCG AGCTACACGC CCACTCGGCC
TACAGCTTTC TCGATGGTGC GAGCACTCCC GAGGAACTGG TGGAGGAGGC GTCCCGGCTG
GGCCTGCGGG CCCTCGCGCT GACCGACCAC GACGGGCTCT ACGGTGTGGT GCGCTTCGCC
GAAGCGGCCA AGGAGCTCGA CGTGGACACG GTCTTCGGTG CAGAACTGTC CCTCGGTGGG
GGAACCCGCA CCGACGTCCC CGACCCGCCC GGCCCGCACC TGCTGGTGCT CGCCCGCGGG
CCGGAGGGCT ACCGGCGACT GTCGCGGCAG CTGGCTGCGG CGCATCTGGC CGGTGGCGAG
AAAGGGGTGC TGCGCTATGA CTTCGACGCG TTGACGGAGG CCGCGGGCGG GCACTGGCAG
ATCCTCACCG GATGTCGCAA GGGTCATGTC CGCCAGGCGC TGTTGCGTGG AGGCGACAGT
GCTGCCGAGG CCGCGCTGGC CGATCTGGTG GATCGGTTCG GCCGCGAGCG GGTCACTGTC
GAGCTCACCC ACCACGGCCA TCCGCTCGAC GACGAACGCA ATGCCGCACT GGCCGCGCTG
GCGCCGCGGT TCGGCCTCAC CGTCATCGCC ACCACCGCCG CGCATTTCGC CGAACCGTCC
CGGGGCAGGC TCGCGATGGC GATGGGGGCG ATCCGGGCCC GGAACTCGAT CGACGAAGCA
GCGGGTTACC TTGCCCCGCT CGGCGGGTCG CATCTGCGGT CGGGGGAGGA GATGGCACGG
GTGTTCGCCC ACTGCCCCGA GGTGGTGACG GCCGCCGCCG ATCTCGGTGA ACAGTGCGCG
TTCGGCCTTG CGCTGATCGC TCCACAGCTG CCGCCGTTCG AGGTCCCGGC GGGGCACACC
GAGAACAGCT GGTTACGGCA CCTGGTGATG CAGGGCGCCC GCGAACGCTA CGGCCCGCCC
GAGCGCGCGT CGCGCGCCTA CGCCCAGATC GAGCACGAGC TGCGGGTCAT CGAGCAATTG
AATTTTCCGG GTTATTTCCT TGTGGTGCAC GACATCACCC GGTTCTGCCG GGAGAACGCG
ATCCTGTCCC AGGGTCGGGG GTCGGCGGCC AACTCGGCGG TCTGCTACGC CCTCAAGGTC
ACCAACGTCG ACCCGATCGC CAACGGGTTG TTGTTCGAAC GCTTCCTGTC CCCGGCCCGC
GACGGGCCAC CCGACATCGA CATCGACATC GAATCCGACC TGCGCGAGAA GGCGATCCAG
TACGTCTACG AACGCTATGG GCGCGAGTAC GCCGCCCAGG TGGCCAACGT GATCACCTAC
CGGGGACGCA GCGCGGTGCG CGACATGGCC CGCGCGCTGG GATTCTCGCA GGGGCAGCAG
GATGCCTGGA GCAAGCAGAT CAGCCAGTGG GGCAACCTGG CCGACGCCAC CCATGTCGAG
GACATCCCGG AGCCCGTGAT CGACCTGGCC ATGCAGATCT CCCACCTGCC CCGGCACATG
GGAATCCACT CCGGCGGCAT GGTGATCTGC GACCGCCCGA TCGCCGACGT GTGTCCGGTC
GAGTGGGCGC GGATGGAGAA CCGCAGCGTC CTGCAGTGGG ACAAAGACGA CTGTGCCGCA
ATCGGTTTGG TCAAGTTCGA TCTGCTGGGG CTCGGTATGC TCTCGGCACT GCACTACGCG
ATCGACCTGG TGGCCGAGCA CAAAGGCCTT GAGGTGGACC TGGCCAAGCT CGACCTCTCG
GAACCGGCGG TGTACGAGAT GCTGCAGCGG GCCGACTCCG TCGGGGTGTT CCAGGTGGAA
TCCCGGGCGC AGATGGCGAC CCTGCCCCGG CTCAAGCCCC GGATGTTCTA CGACCTGGTG
GTCGAGGTCG CGCTGATCCG GCCCGGCCCC ATCCAGGGTG GCTCGGTGCA TCCCTACATC
AAGCGGCGCA ACGGTCAGGA GCCGGTCACC TACGAGCATC CCTCGATGGA GCGGGCACTG
CGAAAAACGT TGGGGGTGCC GCTGTTCCAG GAGCAGTTGA TGCAGCTGGC GGTCGACTGC
GCGGGCTTCT CGGCGGCCGA GGCCGACCAG CTGCGCAGGG CGATGGGCTC CAAGCGCTCG
ACGGAGAAGA TGCGCCGGCT GCGCGGACGG TTCTTCGAGG GCATGGCCGA ACTGCACGGC
ATCAGCGGAG ACGTCGCGCG CAGGATCTAC GAGAAGCTGG AGGCCTTCGC GAATTTCGGC
TTCCCCGAGA GTCATTCGCT GAGCTTCGCG TCGCTGGTGT TCTACTCCTC GTGGTTCAAG
TTGCACCATC CCGCGGCGTT CTGCGCCGCA CTGCTGCGCG CGCAGCCGAT GGGCTTCTAC
TCACCGCAGA CGCTGGTCGC CGACGCCCGC AGGCACGGCG TCGACGTGCA CGGGCCGGAC
GTGAACGCCA GCCTCGCGCA CGCCACGCTG GAGAACCACG GACTCGACGT GCGACTCGGT
CTCGGCAGCA TCCGCCACAT CGGCGACGAG CTGGCGCAAC GTCTGGTGGA GGACCGAAAG
CTCAACGGCC CCTTCGTTTC TCTGACCGAT CTGACCAGGC GGGTGCAGCT GACGGTCCCG
CAGACCGAGG CGCTGGCGAC CGCGGGTGCA CTGGGATGCT TCGGGATCAC CCGGCGGGAG
GGTCTGTGGG CGGCGGGCGC GGCTGCCACC GAACGGCCGG ATCGGCTCCC GGGGGTGGGT
TCGTCCTCGC AGGTGCCGTC CCTGCCCGGC ATGACGGAAC TGGAGCTGAC GGCCGCGGAC
GTGTGGGCGA CCGGGGTGTC CCCGGACCGC TACCCGACGG AGTTCCTGCG TGAAGACCTC
GACGCGATGG GCGTGGTGCC CGCCGACCGG CTGTTGTCGG TGCGCGACGG CACCCGGGTG
CTGGTGGCGG GGGCGGTGAC GCACCGGCAG CGGCCCGCCA CCGCACAGGG GGTGACGTTC
CTGAACCTCG AAGACGAGAC CGGTATGGTC AATGTGCTCT GCTCCCAAGG CATCTGGGCC
CGGCACCGCA AACTGGCGCA GACCGCGTCG GCGCTGGTGG TCCGCGGCAT CGTGCAGAAC
GCCACCGGGG CCGTCACGGT GGTCGCCGAC AGGATGGGCC CGATCAACAT GAAGGTCGCG
TCGAAGTCGC GCGACTTCCG CTGA
 
Protein sequence
MRRVLEGKPR RAGWPIDAQV GDGGDSPAWS SKRGQYRAPE SRGPATGRSM PYAELHAHSA 
YSFLDGASTP EELVEEASRL GLRALALTDH DGLYGVVRFA EAAKELDVDT VFGAELSLGG
GTRTDVPDPP GPHLLVLARG PEGYRRLSRQ LAAAHLAGGE KGVLRYDFDA LTEAAGGHWQ
ILTGCRKGHV RQALLRGGDS AAEAALADLV DRFGRERVTV ELTHHGHPLD DERNAALAAL
APRFGLTVIA TTAAHFAEPS RGRLAMAMGA IRARNSIDEA AGYLAPLGGS HLRSGEEMAR
VFAHCPEVVT AAADLGEQCA FGLALIAPQL PPFEVPAGHT ENSWLRHLVM QGARERYGPP
ERASRAYAQI EHELRVIEQL NFPGYFLVVH DITRFCRENA ILSQGRGSAA NSAVCYALKV
TNVDPIANGL LFERFLSPAR DGPPDIDIDI ESDLREKAIQ YVYERYGREY AAQVANVITY
RGRSAVRDMA RALGFSQGQQ DAWSKQISQW GNLADATHVE DIPEPVIDLA MQISHLPRHM
GIHSGGMVIC DRPIADVCPV EWARMENRSV LQWDKDDCAA IGLVKFDLLG LGMLSALHYA
IDLVAEHKGL EVDLAKLDLS EPAVYEMLQR ADSVGVFQVE SRAQMATLPR LKPRMFYDLV
VEVALIRPGP IQGGSVHPYI KRRNGQEPVT YEHPSMERAL RKTLGVPLFQ EQLMQLAVDC
AGFSAAEADQ LRRAMGSKRS TEKMRRLRGR FFEGMAELHG ISGDVARRIY EKLEAFANFG
FPESHSLSFA SLVFYSSWFK LHHPAAFCAA LLRAQPMGFY SPQTLVADAR RHGVDVHGPD
VNASLAHATL ENHGLDVRLG LGSIRHIGDE LAQRLVEDRK LNGPFVSLTD LTRRVQLTVP
QTEALATAGA LGCFGITRRE GLWAAGAAAT ERPDRLPGVG SSSQVPSLPG MTELELTAAD
VWATGVSPDR YPTEFLREDL DAMGVVPADR LLSVRDGTRV LVAGAVTHRQ RPATAQGVTF
LNLEDETGMV NVLCSQGIWA RHRKLAQTAS ALVVRGIVQN ATGAVTVVAD RMGPINMKVA
SKSRDFR