Gene Mlab_0620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0620 
Symbol 
ID4795702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp595504 
End bp597144 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content54% 
IMG OID640099280 
Producthypothetical protein 
Protein accessionYP_001030061 
Protein GI124485445 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTTG TCGAGTTTGC AGAGATATGT AATACGATCG AGGGAACATC CTCGCGTCTT 
GCGACGGCCG ATATTCTGGC CGAAAAATTT CCTCTACTCA CCGAAGAGGA ACTGCCCGTC
TTCGTCAGAT TCATGCGGGG AAGACTCTTT CCCGACTGGT CGTCCGAGAA GCTCGGGTTC
GGCCCGAATC TTTTGTATGA TGCCCTTGCC TACGTCATCG GTAAAAAGCG GAGTTATGTG
GTCTCGGCGA TCAACAACGC AGGTGATATC GGTAAAGTCG TCGAGAGTCT TATCGAAAAA
CGGGAGCAGA CGATGTTTTT CTCGGAAGAA CTGGATCTTC TGGATGTGAA CGCCCGGTTT
CTGCAGATGG CAAAATCCTC GGGAAGACGA TCTCAGCAGG AGCGGCTGAG ATCCGCCCAG
TATCTTCTTT CGAACGCAAC GCCGCTTGAG GGGAGATACC TTGCCCGTCT CATGCTCGAA
GAGATGCGGA TCGGTGTTGG AGAAGGCGTC GTAAAAGACG CGGTCTCAAA AGCGTTCGGC
GTTCCGTCAG ACATTGTCGA ACACGCTCAT CAGGCTCTCA ACGACCTCGG AGAGGTGGCG
TTCCTCGCAA AAACCGACCC CTCCCGGCTT TCCAATGTTC ATATCACCGC GTTTCGCCCG
GTAAAAATGA TGCTTGCCCA GCAGGGCTCC ATCACCTCGA TGGTCGAAAC GCATGGAACA
CTCGCTGCGG AAAACAAATA CGACGGCAGC AGATTCCAGT TCCATAAATC CGGCGGGAAG
TGTGCGATCT ACTCCAGGCG TCTTGAAGAG ATGACGGCGT CACTTCCCGA TGTTGTTAAG
ATGCTGGATA AGGCGACCGA TCATGACGTC ATCATCGACG GCGAGGTGAT CGCGATCATG
AATGGAAAAC CCATGCCTTT CCAGACAATC CTTCGGCGGA TCCGCAGGAA GCATGATGTC
GGGGATGCGC AGGAGGCAAT CACCCTTCTT CCCTGGGTGT TCGATATCCT TGCTGCCGAC
GGAGAGACCC TGATCGATCT GCCCTTTAGG GAGCGGCGCA AGATCCTTGA ATCCGTCATG
AATGCGTATG TCGCACCGCA GCTCGTCAGC GACTCCGCAG AAGAGATCGA AGCGTATTAC
CACTCCTCGC TCGACAATGG GAATGAAGGG ATCATGCTCA AAGTGCTCGA CTCGCCGTAT
CTTCCGGGAA ATCGTGGCAA GCTCTGGATC AAGATCAAGC CCGAGGTCGA TACGATCGAC
CTGGTGGTGA CGGCGGCCGA GTGGGGCGAA GGAAAACGTG CGAAGATGTT CGGCTCGTTC
CTTCTTGCCT GTCAGGATGA GAACGGCGAC CTTCTGGAGA TCTCGCGGGT GGCGACCGGT
ATCGACGACT CTATGCTTTC GACCCTGTAT GATCTTTTCA AAGACAAGAT CATTGCCGAA
AAGGGAAAGA CCGTGACCTT CGAACCGGAT GTGGTCTTTG AGGTGGGGTA TGCGGAACTG
CAGAAAAGTA CGAATTACGA GGCCGGGTAT GCTCTCCGGT TCCCGCGTTT TGTCAGGCTT
AGGGACGACA AGGATGTCTC GGAGATCGAG ACGCTGGAAA GCCTGACCCG CCGGTATTCT
CTGCAGAATA AGGAAGAGTA A
 
Protein sequence
MQFVEFAEIC NTIEGTSSRL ATADILAEKF PLLTEEELPV FVRFMRGRLF PDWSSEKLGF 
GPNLLYDALA YVIGKKRSYV VSAINNAGDI GKVVESLIEK REQTMFFSEE LDLLDVNARF
LQMAKSSGRR SQQERLRSAQ YLLSNATPLE GRYLARLMLE EMRIGVGEGV VKDAVSKAFG
VPSDIVEHAH QALNDLGEVA FLAKTDPSRL SNVHITAFRP VKMMLAQQGS ITSMVETHGT
LAAENKYDGS RFQFHKSGGK CAIYSRRLEE MTASLPDVVK MLDKATDHDV IIDGEVIAIM
NGKPMPFQTI LRRIRRKHDV GDAQEAITLL PWVFDILAAD GETLIDLPFR ERRKILESVM
NAYVAPQLVS DSAEEIEAYY HSSLDNGNEG IMLKVLDSPY LPGNRGKLWI KIKPEVDTID
LVVTAAEWGE GKRAKMFGSF LLACQDENGD LLEISRVATG IDDSMLSTLY DLFKDKIIAE
KGKTVTFEPD VVFEVGYAEL QKSTNYEAGY ALRFPRFVRL RDDKDVSEIE TLESLTRRYS
LQNKEE