Gene Mlab_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1203 
Symbol 
ID4795143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1231173 
End bp1234370 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content42% 
IMG OID640099877 
Producthypothetical protein 
Protein accessionYP_001030639 
Protein GI124486023 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.666944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0115575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTC CTATTTCGTC TCAAGAGCAG ATAATTCAGA TAGTCAATAC AGCTCTTCAG 
AACAGAAAGG ACAGTACAGT CAATATCTTA AATGATAAGC TTACTCTGTC AGTATTTTCC
GAACTTGAAA GAAATCTGCA GAATGTTTCG GAAATCAACT TTATTGTCCG TGAAGAACAT
AGTGTTCCTG ATAAAAAAGA GTTGATTCAT GAGTTTGAAC TAACAACAAA ACCATCTGAT
ATTCTCTTTA ATAGCTATGA AATAGCGGAA AAAAACAAAC TCCGGTATTT CCACAAGGCA
CGGACGATGC ATGATTTCAT CAAGTCCAAC GTGAATGTCA GAAAAACCCT TAATCCAAAT
ATGGTGAGAG GAAATGTTCT TTTAATCGAT GATGATGTTC AGATTCAGGG AACATCGTCA
CTGGAAATAT CGAAGCCAAA AACTATTCAC GGTCTTCCTC AGATCAATTT TGATACATTT
ATCAATAGTT CCATGGATAA GGAACAGATC ACTCGGTCTC TTAAGTTATT CCATACACTT
TGGAATACCA GTGGATATAC CCAGGATTTC AAAGAAGAGC TTCTTGAGAG TCTGTTATAT
ATCTACAAAG AGTATTCTCC AGAGTTTTTG TATTATTATA CGCTGTACGC CCTGTTTGGG
GATCAATTGG ATGCGAGTGT GGAGCATTTT GAGAATGACA ATACTCGATT TAAGAAGACA
AAGATCTGGA ATTCGTTGTA TAAGTTTCAG CAAGACGCCG TTGTTACTGC AATTCAGAGG
ATAAACCAAT ATAACGGCTG TATTATTGCA GATAGTGTTG GTCTTGGAAA GACATACGAA
GCATTGGCCG TTATCAAATA TTTCGAGATG AGGAACGACA ACGTTCTGGT ATTAGCTCCG
GCAAAACTCT ATGACAATTG GGATTCTTTC AAGAGCCCAT ATGTTGACAA TCCACTGTCA
GATGATAAAC TCAACTACAA AGTGCTCTCC CACACTGATT TGTCACGAGA TACAGGCTAT
TCACGAAGCG GGCTTGATCT TTCGCGTATT GACTGGGGGA GTTTTGACCT TCTGGTAATA
GATGAATCGC ATAATTTCAG GAATAGAACA GATAGTAAAG ACCACGTTAC ACGGTATCAG
AAGCTTCTCA CGGACATTAT CAAAAAGGGC GGGAACACAA AAGTCCTTCT GTTATCAGCA
ACTCCGGTGA ATAATTCTCT GGTAGATTTA CGCAATCAGA TCAGTATCAT TACTTCAGAT
CGTGATTTTG CGTATGAAAA TGGGGGTATT CCAAGTATTT CCCAGGTTCT TATCAAAGCA
CAGCGTGAGA TAAATGACTG GTCCAGAAGT TCCAAGAGAA ATAAAACAGT CCTCCTGGAT
AGCCTACCGT CCGAGTTTTA TAAGCTTCTT GAGATGGTGA CAATCTCGCG AAGCCGCAAG
CATATTACGA GTTGTTATGG GTCGGAAAAT CTGGCGATGT TTCCAAAGAA ATGCATTCCA
TTGACCTTCA GACCGGGCAT CGATTCCGAC GGCATGGTAA TGAAATTCGA CGAGGTGAAT
GAGGAGCTTG AAACTCTGCT TCTTTCGGTA TATTCACCAA TGGCTTATAT CCTCCCAGAG
TATCAGGCAG AATATCGTGA GAAGTATGCG ACAAAGATTA ATGATCGCGA GGTGTTTTTT
CACGAACAGC GAGAGAGTAT CAATATCAAA CTTCACCGGT TTAATCTGTG TAAGCGGCTA
GAAAGTTCAG TGTATTCGTT TGGGAAAACG CTGGAACGGA TTCTTGGACG TATTGACGGG
TATCTTCTTT CTTTGGAGAG AGGAGAGAAG CTCCTTACTG CAGATACTAA TGGCGAGGAT
CTGGATGAGG ATGAACTTGC AGAGATGGAA GATGCATCTT ATCTGGAGTA CAAATACGAG
ATAGATGTCC GGCATTTGGA CGTGACGAAT TACATTGAGG ATTTGCAGGC AGATAAGCGA
AAGATCAAGA AGATTCTTGG TCAGGTGAAT ACAGTCCTTG AAGGGAAACG TGATGAGAAA
CTGGCGACAG TGCAGGCGTT CGTTCTCGAT AAAATACAGA AAACCCCGTA TAATGTAGGG
AACAAGAAGG TTTTGATCTT CTCGGCATTT GCAGATACGG CAAATTATCT GTATGATAAT
TTGTCTCCGC TTTTACGAGC CAAGGGTATT CATACAGGAA TTGTTACCGG AGGAAATAAG
CCCCGGACAA CAATAAAGAA GGTCAATCTG AAATATAACG AGATTCTATC GCATTTCTCT
CCAATTTCAA AAGAGCGACC GGATGCGCGG AATTTTGGAG AGATCGAGGT TCTTATCGGA
ACGGATTGCA TTTCGGAGGG TCAAAATCTT CAGGATTGTG ATTGTGTCGT AAATTATGAT
ATTCAGTGGA ATCCTGTTGT TTTGATTCAG AGGTTTGGAA GGATCGATCG CCTGGGCAGT
TTGAATAAAC GGATCCAGAT GGTGAATTTC TTCCCAGATA TGGATCTGAA TGAGTATCTG
CAGCTTGAGG AGAGAGTGAA GCGTAAGATG GTTGCTGCAA ATCTGGGTTC GACGGGCGAT
GAGGATTTGT TGAGTCCGGA GATGAATGAT CTAGAGTTCA GGCGTGTTCA ACTGGAGAGG
CTGCAAAAAG AAGTTGTGGA TCTTGAGGAG ATGAGCGATT CGATTTCCCT GACTGATCTG
AATATGAACG GATATCTGAA TGAGTTGTAT GAGTTTGTGT CGGCACATCC CGAGGTGAAA
AAAGTTCCCT CCGGACTTTT TTCGATTACG AAGGGGGAGG AGAAGGGTTG TCTGTTCTGT
TTCCGTCATA TGGATAATCT GGCGAAACCG AAGAGTGACA GTTCGCTGTA TCCGTATTAT
CTTCTGTATA TGAAAAATTC TGGTGAGGTT TATATCGGGA TGCATAATGC CCGTGAGGCG
TTGTCGGAGT TCAGGCGTTT GTCCTATGGT AAAGAAGTGC CAGAAATGCG GCTTTTCCAG
CTGTTCAATG CAAGGACGAA GTATGCTTCG GATATGACGA AATATTCGAA GCTTATCACA
AAGGCGATTT CGGCAATTAC GGGAGCAGAG CGAAAACGTG CCGAGGAGAG TATTTTTGAT
TTCACAGGGT TCGTAGACGA GTTTGCTCAT ACTGCGGAGG ATGATTTCGA GTTGATTTCC
TTTTTAATTG TGGAGTGA
 
Protein sequence
MASPISSQEQ IIQIVNTALQ NRKDSTVNIL NDKLTLSVFS ELERNLQNVS EINFIVREEH 
SVPDKKELIH EFELTTKPSD ILFNSYEIAE KNKLRYFHKA RTMHDFIKSN VNVRKTLNPN
MVRGNVLLID DDVQIQGTSS LEISKPKTIH GLPQINFDTF INSSMDKEQI TRSLKLFHTL
WNTSGYTQDF KEELLESLLY IYKEYSPEFL YYYTLYALFG DQLDASVEHF ENDNTRFKKT
KIWNSLYKFQ QDAVVTAIQR INQYNGCIIA DSVGLGKTYE ALAVIKYFEM RNDNVLVLAP
AKLYDNWDSF KSPYVDNPLS DDKLNYKVLS HTDLSRDTGY SRSGLDLSRI DWGSFDLLVI
DESHNFRNRT DSKDHVTRYQ KLLTDIIKKG GNTKVLLLSA TPVNNSLVDL RNQISIITSD
RDFAYENGGI PSISQVLIKA QREINDWSRS SKRNKTVLLD SLPSEFYKLL EMVTISRSRK
HITSCYGSEN LAMFPKKCIP LTFRPGIDSD GMVMKFDEVN EELETLLLSV YSPMAYILPE
YQAEYREKYA TKINDREVFF HEQRESINIK LHRFNLCKRL ESSVYSFGKT LERILGRIDG
YLLSLERGEK LLTADTNGED LDEDELAEME DASYLEYKYE IDVRHLDVTN YIEDLQADKR
KIKKILGQVN TVLEGKRDEK LATVQAFVLD KIQKTPYNVG NKKVLIFSAF ADTANYLYDN
LSPLLRAKGI HTGIVTGGNK PRTTIKKVNL KYNEILSHFS PISKERPDAR NFGEIEVLIG
TDCISEGQNL QDCDCVVNYD IQWNPVVLIQ RFGRIDRLGS LNKRIQMVNF FPDMDLNEYL
QLEERVKRKM VAANLGSTGD EDLLSPEMND LEFRRVQLER LQKEVVDLEE MSDSISLTDL
NMNGYLNELY EFVSAHPEVK KVPSGLFSIT KGEEKGCLFC FRHMDNLAKP KSDSSLYPYY
LLYMKNSGEV YIGMHNAREA LSEFRRLSYG KEVPEMRLFQ LFNARTKYAS DMTKYSKLIT
KAISAITGAE RKRAEESIFD FTGFVDEFAH TAEDDFELIS FLIVE