Gene Mvan_1530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1530 
SymboldnaE2 
ID4648263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1618327 
End bp1621623 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content70% 
IMG OID639805027 
Producterror-prone DNA polymerase 
Protein accessionYP_952367 
Protein GI120402538 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.94615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTTGGC ATACCGGTCC GCCGAGCTGG ACCGAGATGG AGCGCGTGCT CTCCGGTAAG 
CCGCGTCGCG CCGGCTGGCC GATCGATCAG CAGATCGGCG ACGGCGGGGA CAGCCCCGCC
TGGTCGCGCA AACGCGGGGA GTATCACGCG CCGGAGGGGC CCGGCGCGCA GGAGTCGTCG
ACGCCGTATG CGGAGCTGCA CGCCCACTCG GCCTACAGCT TTCTGGACGG CGCCAGCACG
CCGGAGGAAC TCGTGGAGGA GGCCGCCCGG CTGAACCTGC GGGCCATCGC GCTGACCGAC
CACGACGGGC TCTACGGCGT GGTGCGGTTC GCCGAGGCGG CCAGGGAACT CGACGTGGCG
ACGGTGTTCG GCGCGGAATT GTCCCTGGGC GGGGGAACCC GCACCGACGT CCCGGACCCG
CCCGGGCCGC ACCTGCTGGT GCTGGCCCGC GGGCCGGAGG GCTACCGGCG GTTGTCGCGG
CAGCTGGCCG CGGCGCATCT GGCCGGTGGT GAGAAGGGGG TGCTGCGCTA CGACTTCGAC
GCACTGACCG AGGCCGCGGG CGGGCATTGG CAGATCCTCA CCGGATGCCG TAAAGGGCAT
GTCCGGCAAG CTCTTTCCAC TGGCGGCCCG GAGGCGGCCG AGGCCGCGCT GGCCGATCTG
GTCGACCGCT TCGGTCCGGA CCGGGTGACC GTCGAGCTCA CCCACCACGG CCATCCACTC
GACGACGAGC GCAACGCCGC GCTGGCCGCG CTGGCGCCGC GGTTCGGTCT GGGCGTCGTC
GCCACCACCG CCGCGCACTT CGCCGAACCG GCCAGGGGTC GGCTCGCGAT GGCGATGGGC
GCCATCCGGG CCCGCAACTC GATCGATGAA GCCGCGGGTT ACCTTGCGCC GCTGGGTGGT
TCGCATCTGC GCTCGGGCGA CGAGATGGCC CGGATGTTCG CCCACTGCCC CGAGGTGGTG
ACGGCCGCGG CCGAACTCGG CGAACAGTGC GCCTTCGGTC TCGCCCTGAT CGCACCGCAG
CTGCCGCCGT TCGACGTGCC CGACGGGCAC ACCGAGAGCA GCTGGCTGCG GCATCTGGTG
ATGCAGGGCG CCCGCGAGCG CTACGGCCCC CCGGAGCGGG CCTCCCGGGC GTACGCGCAG
ATCGAGCACG AGCTGGCGGT GATCGAGCAG CTGAACTTTC CCGGCTACTT CCTCGTGGTG
CACGACATCA CCAGGTTCTG CCGGGACAAC AACATCCTGT CCCAGGGCCG GGGGTCGGCG
GCCAACTCCG CGGTCTGCTA TGCGCTCAAG GTCACCAACG TCGATCCGAT CGCCAACGAC
CTGCTGTTCG AACGCTTCCT GTCCCCGGCC CGCGACGGGC CACCCGACAT CGACATCGAC
ATCGAATCCG ACCTGCGCGA GAACGCGATC CAGTACGTCT ACCAGCGTTA CGGGCGGGAG
TACGCCGCCC AGGTCGCCAA CGTGATCACC TACCGGGGGC GCAGCGCGGT GCGGGACATG
GCCCGTGCGC TCGGGTTCTC CCAGGGTCAG CAGGACGCCT GGAGCAAGCA GGTCAGCCAG
TGGGGGAATT TGGCGGACGC CACGCACGTC GAGGACATCC CCGGGCCGGT GGTCGACCTG
GCCAAGCAGA TCTCGAATCT GCCCCGGCAC ATGGGCATCC ATTCCGGCGG CATGGTGATC
TGCGACCGTC CGATCGCCGA CGTGTGCCCG GTCGAATGGG CGCGGATGGA GAACCGCAGC
GTCCTGCAAT GGGACAAGGA TGATTGCGCG GCAATCGGTT TGGTCAAGTT CGACCTGCTG
GGCCTCGGCA TGCTCTCAGC GCTGCACTAC GCCATCGACC TGGTCGCCGA GCACAAGGGC
ATCGAGGTCG ATCTGGCGAA GCTGGATCTG TCGGAACCCG CCGTGTACGA GATGTTGGCC
CGCGCGGACT CGGTCGGGGT GTTCCAGGTG GAGTCGCGCG CCCAGATGGC CACGCTGCCC
CGGCTCAAGC CGCGGATGTT CTACGACCTG GTGGTCGAGG TGGCGCTGAT CCGGCCGGGG
CCGATCCAGG GTGGGTCGGT ACACCCGTAC ATCAAGCGCC GCAACGGGCA GGAGGCGGTC
ACCTACGACC ATCCGTCGAT GGAGTCGGCG CTGCGCAAGA CGTTGGGGGT GCCGCTTTTT
CAGGAGCAGC TGATGCAGCT GGCCGTCGAC TGCGCCGGTT TCACCCCGGC CGAGGCCGAC
CAGTTGCGGC GCGCGATGGG CTCGAAACGC TCGACGGAGA AGATGCGCAG GCTGCGTGGG
CGCTTCTTCG ACGGCATGGC CGAACTGCAC GGCGTCACCG GCGACGTGGC GCAGCGGATC
TACGAGAAGC TGGAAGCGTT CGCCAACTTC GGCTTCCCGG AGAGTCATTC ACTGAGCTTC
GCGTCGCTGG TGTTCTATTC GTCGTGGTTC AAGCTGCACC ACCCGGCGGC GTTCTGCGCC
GCGCTGCTGC GGGCGCAACC GATGGGCTTC TACTCACCGC AGTCACTGGT CGCCGACGCC
CGCAGGCACG GCGTCGTCGT GCACGGACCC GACGTGAACG CCGGCCTGGC GCACGCCACG
CTGGAGAACC ACGGGCTGGA TGTGCGGCTG GGGCTGGGCG GCGTCCGTCA CATCGGCGAC
GAGCTCGCCG AACGCCTGGT CGGGGAGCGG AAAGCCCACG GGCCGTTCAC TTCTCTGACC
GATCTGACTC GACGGGTGCA GCTGTCGGTA CCCCAGACCG AGGCGCTGGC CACCGCGGGT
GCGCTGGGCT GTTTCGGGAT CACCCGGCGG GAGGGGCTGT GGGCCGCGGG AGCAGCCGCC
ACCGAACGGC CCGACCGGCT GCCCGGGGTC GGGTCGTCCT CCCATGTGCC GTCGCTGCCA
GGCATGACCG AGCTGGAGCT GACCGTCGCC GACGTGTGGG CCACCGGGGT GTCGCCGGAC
CGGTACCCGA CCGAGTTCCT GCGCGAGGAC CTGGACGCGA TGGGGGTGGT GCCCGCCGAT
CAGCTGTTGT CGCTGCCCGA CGGCACCCGG GTGCTGGTGG CCGGAGCGGT GACCCACCGG
CAGCGGCCCG CCACCGCGCA GGGGGTGACG TTCATGAACC TCGAAGACGA AACCGGTATG
GTCAATGTGT TGTGCTCACA AGGGGTTTGG GCACGTCACC GTAAGCTGGC GCAGACGGCG
TCGGCGCTGG TGGTGCGCGG CATCGTGCAG AACGCCACCG GGGCCGTCAC CGTGGTCGCC
GACCGGATGG GCAGGCTCAG TCTGCGGGCG GCGTCGAAGT CCCGCGACTT CCGCTAG
 
Protein sequence
MGWHTGPPSW TEMERVLSGK PRRAGWPIDQ QIGDGGDSPA WSRKRGEYHA PEGPGAQESS 
TPYAELHAHS AYSFLDGAST PEELVEEAAR LNLRAIALTD HDGLYGVVRF AEAARELDVA
TVFGAELSLG GGTRTDVPDP PGPHLLVLAR GPEGYRRLSR QLAAAHLAGG EKGVLRYDFD
ALTEAAGGHW QILTGCRKGH VRQALSTGGP EAAEAALADL VDRFGPDRVT VELTHHGHPL
DDERNAALAA LAPRFGLGVV ATTAAHFAEP ARGRLAMAMG AIRARNSIDE AAGYLAPLGG
SHLRSGDEMA RMFAHCPEVV TAAAELGEQC AFGLALIAPQ LPPFDVPDGH TESSWLRHLV
MQGARERYGP PERASRAYAQ IEHELAVIEQ LNFPGYFLVV HDITRFCRDN NILSQGRGSA
ANSAVCYALK VTNVDPIAND LLFERFLSPA RDGPPDIDID IESDLRENAI QYVYQRYGRE
YAAQVANVIT YRGRSAVRDM ARALGFSQGQ QDAWSKQVSQ WGNLADATHV EDIPGPVVDL
AKQISNLPRH MGIHSGGMVI CDRPIADVCP VEWARMENRS VLQWDKDDCA AIGLVKFDLL
GLGMLSALHY AIDLVAEHKG IEVDLAKLDL SEPAVYEMLA RADSVGVFQV ESRAQMATLP
RLKPRMFYDL VVEVALIRPG PIQGGSVHPY IKRRNGQEAV TYDHPSMESA LRKTLGVPLF
QEQLMQLAVD CAGFTPAEAD QLRRAMGSKR STEKMRRLRG RFFDGMAELH GVTGDVAQRI
YEKLEAFANF GFPESHSLSF ASLVFYSSWF KLHHPAAFCA ALLRAQPMGF YSPQSLVADA
RRHGVVVHGP DVNAGLAHAT LENHGLDVRL GLGGVRHIGD ELAERLVGER KAHGPFTSLT
DLTRRVQLSV PQTEALATAG ALGCFGITRR EGLWAAGAAA TERPDRLPGV GSSSHVPSLP
GMTELELTVA DVWATGVSPD RYPTEFLRED LDAMGVVPAD QLLSLPDGTR VLVAGAVTHR
QRPATAQGVT FMNLEDETGM VNVLCSQGVW ARHRKLAQTA SALVVRGIVQ NATGAVTVVA
DRMGRLSLRA ASKSRDFR