Gene Mlab_1699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1699 
Symbol 
ID4795011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1731940 
End bp1735365 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content57% 
IMG OID640100390 
ProductDNA polymerase II large subunit 
Protein accessionYP_001031127 
Protein GI124486511 
COG category[L] Replication, recombination and repair 
COG ID[COG1933] Archaeal DNA polymerase II, large subunit 
TIGRFAM ID[TIGR00354] DNA polymerase, archaeal type II, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.672511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC TTGTAACCTC CCCTGCCTAC CGGGCGTATC TCGACGGTCT TATTGCCGGA 
CTCGACCGGG CGATGGAAAT CGCCAATAAG GCGAAAAGCC TCGGCGTCGA TCCGCGGCCA
TACGTGGAGA TACCGATCGC GAGCGATCTT GCAGGAAGAG TGGAGGCTCT GCTTGGATAC
AAAGGCGTGG CCGCCAAGAT CCGGGAACTG GAGGACCGGA TGTCGAGAGA GGAGGCCGCA
CTCAAGATCG GCGACGCGTT CGTCGGCAAA GAGTTCGGCG AATCGACCCG GGAGGATATC
CTCGATCATG CGATTCGTAC CTCCATGGCT CTTCTCACAG AAGGTGTGGT CGCGGCACCG
ACCGAAGGTA TCGGAAAAGT CGCCGTTCGA AGAAACGACG ACGGAACCGA GTATCTCTCG
ATCTACTACG CAGGACCCAT CCGTTCCGCC GGAGGAACGG CCCAGGCACT TTCGGTCCTC
GTCGGCGATT ACGTGCGCCG GCTTCTCAAC ATCGACCGGT ACAAACCCCG TGAAGAAGAG
ATCGAACGCT ACGTCGAGGA GATCAAGCAA TACAACAACA TCCAGAGTAT GCAGTATCTC
CCAAAAGACG ACGATATACG TCTGATCATC AGAAACTGCC CGATCTGTAT CGACGGAGAA
CCGACCGAGC AGGAAGAGAT CTCCGGGTAC CGGAATCTGG AACGGGTCGA GACGAACGTC
GTGAGGGGCG GTATGGGGCT TGTCATCGCC GAAGGTATCG GTCTGAAAGC CCCCAAGATC
CAGAAAAACG TCGCGAAGAT GAATCTCGAC GGCTGGGAGT GGCTCGAAGA ACTCATCAGC
GGCAACACGC CCGCTCAGGA GGTTGAAGAA GAGCCAGGTG TCCACCCAAA GGACAAGTAC
ATGCGGGATA TGCTCGCCGG CCGGCCGTCG TTTTCGTATC CGATGCGAAA AGGAGGTTTC
AGGTTCCGCC TCGGCAGATG CAGAAATACG GGACTTGCGA CCTGCGGGTT CAATCCGGCT
ACCTTACATA TCCTTGACGA TTATCTCGCG GTCGGCACGC AGATGAAGGT CGAGCGGCCG
GGCAAAGCCT GCGGTGTTGT GCCGTGCACG GACATCGAAG GGCCGACCGT TCGTCTGAAA
AGCGGCGAAC TCCGCCGGAT CGACACGCTC GATGATGCGA ATAAATACTA CGATCAGGTG
GAGTATATCC TCGACATCGG GGAGATCCTG ATCTCGTTCG GCGAATTCAT GGAAAACAAC
CATGTCCTGA TGCCGCCGAG CTACTGCGAG GAGTGGTGGA TCCAGGAAGG AGGACCCCGG
CATCCGAAGA ACGAAGCCGA GGCTCTTTCT TTTGTTCTCG AAGGAGCTTA TCTTCATCCG
GATTACACCT GGTTCTGGGA CGACTGCAGC GAGGGGCAGC TGATTTTCCT CTCGGACAAA
GTCGCCGGGA CCGGCTCTCT TCGCGAAGGG GTTTTGTACA TCCCGGAAGA TCCTGCGGTG
AAGTCCGTCC TCGAAGAGCT GCTCGTGCCG CACACCGTGG AAGAGGGCTT TTACGTTGTC
AGAACGCCTC TTGCATTGAT CATGGGGCTT GGTCTGACCG ATACGCTTGC AAAAAGTCCG
ACCTGGAAGA CCCTGCCGCC GTTTTCAAAC GGTCTTTCCA TGGCGATATC TCTCTCCGGT
CTGAAGATGC GGTCGAAGGC CGGGACCCGG GTCGGCGGAA GGATGGGCCG GCCCGGAAAA
TCCGCTCCCC GCAAGATGAA GCCGCCGGTC CACGTGCTTT TCCCGATCGG CGAGTCGGGC
GGGATGAAGC GTTCGATCGA CAATGCGGCG AAGATCTGCA GTTCGGATTC GGAGGAGAAT
TTCCGCGGGA CGTCGGTGAC GACCGGCCAG GTCGAGGGTC TTATCCAGGT CCAGACCGGA
GAGCGCCGGT GTCCAAAATG CGGGACGGTG ACGTTTAAGA GCCGGTGCGC CGACTGCGGC
ACCCATACGG ACGCGGTCTA TCGGTGTCCC CACTGTAATC AGCTTGGAGA GGAGGGTCAG
GAGTCGTGCC CGAAGTGCGG GGCGAATCTG GTCTGCTCGA AGGAGAGCAT CGTGAGTCTA
GGTCAGGAGT ACGCGGCGGC ACTGAAAAAC GTCGGGATGT CTGCTTCGTC GTCGCCGGAA
CTGAAAGGGG TTCGCGGTCT TATTTCGCGT GAACGGGTGG TGGAGCCTTT GGAGAAGGGT
CTGCTTCGGG CGAAGAACAA CATCTTCGTG TTCCGCGACG GGACGATCCG GTATGATATG
ATCGATCTGC CTCTAACGCA TTTCAGGCCG GCCGAGGTCG GGGTTTCGGT GGAAAAACTC
CGGTCGATCG GCTACACGCA GGATATGCAC GGCGCCGATC TGACGGATGC TTCGCAACTC
GTGGAACTCC ACCCGCAGGA TATCATGGTG TCCGTGGACT GCGGCGAGTA TCTGGTCCGG
GTCGCCGCGT ATGTGGACGA GCTTTTGGAA AAAGTGTACG GGATGGAGGC GTTTTATCAC
GTAAAAACGC CGGAGGATCT GGTGGGCCAC CTGGTGATGG GTCTTGCGCC GCACACGAGT
GCAGGGGTGC TTGCCCGGAT CGTGGGGTTC ACGAAAGCGA AGGCGGGATA CGCCCATCCG
TATTACCATG CGGCAAAGCG GCGAAACTGC GACGGGGATG AGGACTGCGT GATGCTTCTG
ATGGACGGTC TGCTGAACTT TTCGCGGTCG TTTCTGCCGA GCACCCGCGG CGGGACGATG
GATGCCCCGC TCGTTTTGAC GACGACGCTG AATCCAAAGG AGGTGGATAA GGAGACGCTG
AACGTGGATG TGATGCCCCG CTATCCGCTG GAGGTGTATA CGGCGTGTCT GACGTACCGT
GCTCCAAAGG AGGTTGGAAA GTTCGTGGAT TACGTGGAGA AGAGGGTTGA GACGCCCGGC
CAGTTCGAGG GCTTTTCCTT TACGCACGAT ACGACGGATA TCTCGGAGGG GCCGATCGAT
ACGATGTACA CGAATCCGAT CCTGAAAGGA ACGGCGGATA AGATCAAGGC GGAGCTGGGG
CTCGCTGACC GAATCCGTGC GGTGGATACG AACGATCTGG CTGAAAGGAT CATCAACAGT
CATTTGATGC CGGATATGAT CGGTAATCTG CGGTCGTTTT CCAAGCAGGC GTTCCGGTGT
CCGAAGTGTA AGACGAGCTA CCGTCGTATT CCGGTTTCGG GGAAGTGTAA TAAGTGCGGG
GGGCCGGTGA AGGCGACGAT GCATAAGGGG AACGTGACGA AGTATCTGGA GATTTCGAAG
TATATGGCGG AGCATTATAC GCTTTCGGAT TATACGAATC AGCGGATCCA GGTGACGGAG
ATGAATATCA ATTCAACGTT CGGAGAAGAG GAGAAGGTCC AGATGGATCT TTCGGATTTT
TTCTGA
 
Protein sequence
MAELVTSPAY RAYLDGLIAG LDRAMEIANK AKSLGVDPRP YVEIPIASDL AGRVEALLGY 
KGVAAKIREL EDRMSREEAA LKIGDAFVGK EFGESTREDI LDHAIRTSMA LLTEGVVAAP
TEGIGKVAVR RNDDGTEYLS IYYAGPIRSA GGTAQALSVL VGDYVRRLLN IDRYKPREEE
IERYVEEIKQ YNNIQSMQYL PKDDDIRLII RNCPICIDGE PTEQEEISGY RNLERVETNV
VRGGMGLVIA EGIGLKAPKI QKNVAKMNLD GWEWLEELIS GNTPAQEVEE EPGVHPKDKY
MRDMLAGRPS FSYPMRKGGF RFRLGRCRNT GLATCGFNPA TLHILDDYLA VGTQMKVERP
GKACGVVPCT DIEGPTVRLK SGELRRIDTL DDANKYYDQV EYILDIGEIL ISFGEFMENN
HVLMPPSYCE EWWIQEGGPR HPKNEAEALS FVLEGAYLHP DYTWFWDDCS EGQLIFLSDK
VAGTGSLREG VLYIPEDPAV KSVLEELLVP HTVEEGFYVV RTPLALIMGL GLTDTLAKSP
TWKTLPPFSN GLSMAISLSG LKMRSKAGTR VGGRMGRPGK SAPRKMKPPV HVLFPIGESG
GMKRSIDNAA KICSSDSEEN FRGTSVTTGQ VEGLIQVQTG ERRCPKCGTV TFKSRCADCG
THTDAVYRCP HCNQLGEEGQ ESCPKCGANL VCSKESIVSL GQEYAAALKN VGMSASSSPE
LKGVRGLISR ERVVEPLEKG LLRAKNNIFV FRDGTIRYDM IDLPLTHFRP AEVGVSVEKL
RSIGYTQDMH GADLTDASQL VELHPQDIMV SVDCGEYLVR VAAYVDELLE KVYGMEAFYH
VKTPEDLVGH LVMGLAPHTS AGVLARIVGF TKAKAGYAHP YYHAAKRRNC DGDEDCVMLL
MDGLLNFSRS FLPSTRGGTM DAPLVLTTTL NPKEVDKETL NVDVMPRYPL EVYTACLTYR
APKEVGKFVD YVEKRVETPG QFEGFSFTHD TTDISEGPID TMYTNPILKG TADKIKAELG
LADRIRAVDT NDLAERIINS HLMPDMIGNL RSFSKQAFRC PKCKTSYRRI PVSGKCNKCG
GPVKATMHKG NVTKYLEISK YMAEHYTLSD YTNQRIQVTE MNINSTFGEE EKVQMDLSDF
F