Gene Mlab_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1684 
Symbol 
ID4795878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1719310 
End bp1721130 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content59% 
IMG OID640100374 
Producthypothetical protein 
Protein accessionYP_001031112 
Protein GI124486496 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.521904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00120785 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTTGA ACCAGCCAAA CCCTCCGCTT CCGGCGGGCT ACACCGCCGA ACTCACTGAA 
ACGATCACTG AACCCCTTTC GCCGAAAGAA GCCAAAATCG GTCTCGACGT CCAATCCTAC
CGGTTCAAGC TCTATAAATT CGGCCAGCTC GTCGACATCC TTACGACCTT TGACACCGGC
CCCGACCTTC GGCGCGCTGT CAGGGAACAC ATCATCGACA CCGAACCGGA AGGCCCCGAA
AAAGAGTTCC TTCTCCGGTT CTTCGACCCG GCATCCACCC CGCCCTGGGA GCGAAAAACC
ATGATCCGCG GGCAGCTCCC CACGCCCGAC GCTCTCCTCA ACTCCCTCAA CGACGAAGGA
AACGCCGTCC GTTTCGAAAA AGAAGCCGGC GGCAACCTCG TCTACGACAT CGCGAGCGGC
CAGTGGTTTG CCTTTATCAC CAACCACTGG GAACCCGCCA GAGAAAAACT CGGGAAGGTC
CTTCGGCTGG TCGGGAAAAG CCTCGAACAG GAACTCGACT ACTGGAAGCG CCGGGCCGCC
GCCGAAAACA CCCCCGAGAT GCGAAACCTC GTTGTCCAGC TCCAGAATCA TGTGAATCTG
AGCAAAAACC ACACCAAACA GGTCGCCCTC AGAAAAATGA TCGAAGGCTC GTCCATGCAG
GTGAATCTTT CCGAGGCGTC CGACGGCCGG TATATCACCT GCAAAAACGG CGCTCTCGAC
TGCAGGACGG GCGAGTTCAT CCCGATCTGG GCGTGCGATT CGATCCGGGA GAAGTACCCG
CTGATCTATC TGGACGCCGT CTACACGCCG GGGCTTCGCT CGCCGGCGTT CATCGACCAC
CTCAAAAAGG TGTTCGATGA CAATGTGAGC GGTCTTTCTG AGGAGGAGCG GACGCTCCAG
ATGATGGAAC TCGGCAGGTG TTTTCTGCGT CTGCTGGGTT ATCTGCTGTT TCCGGGCAAT
CCGGAGCAGG TGATTATTTT TCTGTGGGGG AAGGGGAGTA ACGGGAAGTC GACGACGATC
GACGTGCTGC GGGAGATTTT CGGGTCGGAG ATGTCGGAGG CTTCGGTCCG TGAGCTGTAT
GCGGGGTCCG AGGATCGTCC GGCGTCGGGG GTTGCCCGGT CGCTTTCAAA GCGGGTGATG
CTGATTTCGG AGGCGAGCGA TGAGGAGTCG CGGGGCGGCC GGATCAGCGC GGATACGGTG
AAAGCTTTGA CGGGGGACGC GGTGACAAGC CGGTTTCGGG ATATGTATGA GAAGTCCCGC
CCGCAGCGGG TGGTGTGCAC GCCGGTGGGG GTGACGAATG AGCTGCCCCG GTTCGATAAG
ACGCTGGATT ACGCTCTTCT GCGCCGGATT TTTACGATCC CGTTCCCGCA TTTGTTTGCG
GGGGATGAGC GGGCGCGTGA TATCCGGGAG TGTTTGCTTG CGGAGCGGGA TGCGGTCTTT
TCGATGGTGG CGGATGAGCT GATCGCGTAT ACGAAGGAGG GGCTTTTGCC GCAGCCGGCG
TTTTGCGCAT CCACGCAGAA TGAGCTGCTG GCGGGGTTTG AGGTGTCGGC GTTTATCGAG
GAGTGTGTGG AAAAATCTGA GACGGGGCGT GTGTCCCGTC TGGAGCTGGA GGAGGCGTAT
ATTTCGTGGT GTGCCCGCCA TGATATTCCG GTGGGGCTTG CTAAGATCCA GATGCCGGGG
TATGATGAGT ATTCGCAGGT GAATTTCCGG CAGGGTCTGT CGGAGAAGGA GAAGAGGGGG
CTGTTTAAGG GGATGCGGGT GTATGGGTTT GAGGAACAGC GGACGAACAG TCAGCGGTAT
TTCAAGTGCC GGCTGAAATA G
 
Protein sequence
MALNQPNPPL PAGYTAELTE TITEPLSPKE AKIGLDVQSY RFKLYKFGQL VDILTTFDTG 
PDLRRAVREH IIDTEPEGPE KEFLLRFFDP ASTPPWERKT MIRGQLPTPD ALLNSLNDEG
NAVRFEKEAG GNLVYDIASG QWFAFITNHW EPAREKLGKV LRLVGKSLEQ ELDYWKRRAA
AENTPEMRNL VVQLQNHVNL SKNHTKQVAL RKMIEGSSMQ VNLSEASDGR YITCKNGALD
CRTGEFIPIW ACDSIREKYP LIYLDAVYTP GLRSPAFIDH LKKVFDDNVS GLSEEERTLQ
MMELGRCFLR LLGYLLFPGN PEQVIIFLWG KGSNGKSTTI DVLREIFGSE MSEASVRELY
AGSEDRPASG VARSLSKRVM LISEASDEES RGGRISADTV KALTGDAVTS RFRDMYEKSR
PQRVVCTPVG VTNELPRFDK TLDYALLRRI FTIPFPHLFA GDERARDIRE CLLAERDAVF
SMVADELIAY TKEGLLPQPA FCASTQNELL AGFEVSAFIE ECVEKSETGR VSRLELEEAY
ISWCARHDIP VGLAKIQMPG YDEYSQVNFR QGLSEKEKRG LFKGMRVYGF EEQRTNSQRY
FKCRLK