Gene Mlab_0435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0435 
Symbol 
ID4796178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp419082 
End bp421385 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content60% 
IMG OID640099094 
Producthypothetical protein 
Protein accessionYP_001029878 
Protein GI124485262 
COG category[S] Function unknown 
COG ID[COG2433] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCG TATTCGGCAT CGACAAGATC ACGGGTTCCG CTCATTCCTC GCCGGGCGGC 
CTCTACTCCC TCGTCCGGGT AGTCGACAAA AGGATCGAGT CCGAAGAACG CAGCATCACC
CTCCAGCATC TCCTCCATCT CATCAATGCC GAGAAGCCGG AGATCGTCGC CTTTGCCTCG
CTCCGGGAGA TCGCCCCCGA CACACGCTCC CTTTTTACCC TGACAGAAGC CCTGCCCTTT
GGCACGAAGC TCGTTCTCAT CGCCTGCATC GGGGCAAAAT CCGCCCCTCT CCCGAAACTC
GCCGAGCGAT ACAACCTCCG CTTCGACGCC TCCAACCCTA TGGAACAGGC CAAGATCTCG
GGTCTCATCG CCTCGTTTGG AGGCGGATAC GAGATCAGGA CCTTCGAAGG GGTCACGACC
ATCACCGTGT CGAGAGCCCG CCCGCCGATC CGCACGCACC AGAACCGGTA CATCCGGCAC
ATGCAGGGGT CTCTGATGTC GATCTCCCGC GAGATCGAGG CCGGCCTCCT TGCAAACGGT
CTCATGTTTT CCTCGTCGAT CTCCCGGTCG TTTCGGGGCG AGCGGATCGT GGAATTCAGC
GTCACCGCTC CCCGTCACGA GGTCCCCGCG TCCTCGAAGA CATCGGCCGG AGTTTCCGTC
CGGGTCGCCG GCACGAGAAA GGAGGCTCCC GATTTCCTCC CTATCTCAAA GCGGCCGCCC
TACTTGATCG TCGGCATCGA CCCGGGCACT ACCGTCGGGA TCGCCGCCCT CAATCTCGAC
GGCGACCTGG TCCATCTCTC CAGCACCCGG GCTCTTGGTC AGGCCGAGAT CATCGCGGCA
ATCACTGCGG TCGGAAGACC TGTTTTGATC GCGACCGACA AGGCCGAGAT GCCGTTCGGG
GTGGAAAAAG TCAGACGGGC CTTCTCCGCC GTGGCCTGGA CTCCGGCGAA GGATGTGCTG
ATCAAAGAGA AGTATGATCT TGCCGAAGGC TACGACTTCG GGAACGATCA CGAACGCGAC
TCTCTCTCGG CCGCGGTAAT GGCTTACCGG AGTTATGCGA ACAAGTTCGA GTCGGTCCAG
AAACGCATGC CCCCGGGAAC CGACATCGAC ATGGTCAGGG CTGGGATCAT CCGCGGCCTC
TCCCTCGAGC AGATCCTCGA AACCCTGAAG CATCCGGAAA TTCTGCCCGA AGAGCCGGAG
GTCATTGTGG AAGAGTTCGT CCCCGACGAA AAGGACGAAC GCATCGCCCG GCTCGAAGAG
ACCGTCGCGA AACTTCGTAA ACTCGCTGGA AGTTTATCCG AAGAAGTCGA GGTGAAGGAC
AAGGCGATCG CAGCCCTGCA GAAACGGCTT GTTTTCGAAC GGGAAGAGAG GAACGCGGAG
ATGCTCTCAA CTGAAGCGAT CACCTCCCGC GACACGGAAC TCGCCCAGGT GAAAAAGGCT
CTCAGGAAAG AAGAGCGCCG CAGTAAGACG CTTCGGACCA GACTCGAGCG GATGAAGCAT
TACATCTCTC TGCAGGCAGG CGACGGCTGC ACGGCGCTGA AAGTCCTCCT GCTGCTTGCG
AAGGATCATG TGAAGACGCT GGACGACGAG ATGGGGGTCG GCGAGGAGGA TATCCTGTAT
GTCCTCACGA TCGACGGCTG GGGTAAGACG GTGATCCGCG ATCTTGCGGA GGCGAAGATC
GGGGCGGTGA TCCTGCCGCG GCTGACGTAT CAGCGGGCAA AAAGCCAGCA TCTTATCGAG
GAGTTTAGAG AAGCGGACAT CCCGATCCTG GACGGAGCGA ACCTTTCACC CCGCGTAAAA
GGAAAGATCG GCGTGGTGGA CTCGGCCGCG TTCGAGGCGG CCCTTGCCGA CTGGAAGACG
ACGCAGGATG TCTACAAGAA CGAGAAGAAG ATCGGTGAGA TTCACGGGAT GGTGAAGGAG
TATCAGGTTG AGCGGGTCAA AGATGTCCTG GAGAAAGGGA TCGACCCGAC GCTGGTGTTC
GAGGTGGACC GGGCAAAACC CGAGACGCCG GCGTCACCTC CGGTTCCTGA AAAACCGGCC
ACGGTTTTCA GACCAAAACC CGCTGCCCCG CCTGTGGAAA AACCTGCTCC GGCAGAAAAA
CCCGTGGTAA AGCCCGCAGC AAAACCCGTA GCGAAGCCGG TTCCAAAGCC GGCTCCGGCC
CCGGTGCCAA AACCGGCGCC GAAAAAGGAA ACCCCGAAAA AACCCGCATC CTCCAAGAAG
CCTGAGAACG GATCCGCGGA GAAGATCCTC TTTGGCGTCC TCTCCGAGTA CCGCGAAGAA
CGCAAAAAGG AGCTGAAAAA ATAA
 
Protein sequence
MNLVFGIDKI TGSAHSSPGG LYSLVRVVDK RIESEERSIT LQHLLHLINA EKPEIVAFAS 
LREIAPDTRS LFTLTEALPF GTKLVLIACI GAKSAPLPKL AERYNLRFDA SNPMEQAKIS
GLIASFGGGY EIRTFEGVTT ITVSRARPPI RTHQNRYIRH MQGSLMSISR EIEAGLLANG
LMFSSSISRS FRGERIVEFS VTAPRHEVPA SSKTSAGVSV RVAGTRKEAP DFLPISKRPP
YLIVGIDPGT TVGIAALNLD GDLVHLSSTR ALGQAEIIAA ITAVGRPVLI ATDKAEMPFG
VEKVRRAFSA VAWTPAKDVL IKEKYDLAEG YDFGNDHERD SLSAAVMAYR SYANKFESVQ
KRMPPGTDID MVRAGIIRGL SLEQILETLK HPEILPEEPE VIVEEFVPDE KDERIARLEE
TVAKLRKLAG SLSEEVEVKD KAIAALQKRL VFEREERNAE MLSTEAITSR DTELAQVKKA
LRKEERRSKT LRTRLERMKH YISLQAGDGC TALKVLLLLA KDHVKTLDDE MGVGEEDILY
VLTIDGWGKT VIRDLAEAKI GAVILPRLTY QRAKSQHLIE EFREADIPIL DGANLSPRVK
GKIGVVDSAA FEAALADWKT TQDVYKNEKK IGEIHGMVKE YQVERVKDVL EKGIDPTLVF
EVDRAKPETP ASPPVPEKPA TVFRPKPAAP PVEKPAPAEK PVVKPAAKPV AKPVPKPAPA
PVPKPAPKKE TPKKPASSKK PENGSAEKIL FGVLSEYREE RKKELKK