Gene Mlab_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1054 
Symbol 
ID4794506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1065480 
End bp1066784 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content53% 
IMG OID640099725 
Producthypothetical protein 
Protein accessionYP_001030490 
Protein GI124485874 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.587451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA CAGACATCTT TGGAAAACAG GTCCCTGTTC TCATCAGAAA CGTCAGCCTG 
AATGGAAAAA TACAGGAGAT CTACCTCGAC GGTACAGGAA TGATCGGAGC GGTCGGAGAA
AAAATCACCG ATCACGACGC CGAGTTCATA ATCGATGGTG ACGGGGCAAC AGCACTTCCC
GGCATGATCA ATATGCACAC CCACTCGCCA ATGGGTCTGC TTCGCGGCTA TTCGGATGAC
ATGCAGCTCT TCGAGTGGCT TTCCACCAAG ATCTGGCCGA CAGAGGCACA TCTCACCGAG
GATGATATCT ACTGGGGCGC GAAACTCGCC TGCCTGGAAA TGATCCGTAC CGGAACCACG
ACCTTCAACG ACATGTATTT TAAAATGGAG CAGATCGCCC GTGCCGTTGA TGAATCCGGC
ATCAGAGCAT GTCTTTCTTA CTGCATGATC GACGGCGGGG ACCATGCGAA GTTCGAGTCG
GAAGCCCGTG TCATGGAGTC GACGGTCAAA AATATCAAAA ATATGAACAA TCCACGGGTC
ATGCCCGGCG TTTCCCCCCA TGCGGTCTAT ACCGTTTCAA AAGAGGGTTT GACCTGGTGT
TCAGAGTTTG CCAAAAAGGA GAATATTCCC CTGCATGTAC ATCTTTCCGA GACCGAACAG
GAGGTCACTG ACTGTGTTGC AGCCCACGGC ATGAGACCTC CGGCATGGCT GGATCACTGT
GGTGTCTTGT CCGAGCAGTG CATAGCGGCA CACTGCTGCT GGCTTGATGC TGACGACATT
TCTCTTCTGG CAAAGAGAGG AGTGACCGCC GTCCATAACC CGATCAGCAA TATGAAACTC
GCCGGAAACC GTGCTCTTCC CTATCCGGAG ATGAAAGCAG CCGGCGTGAA TGTGGCTCTT
GGAACGGATG GAGCTTCCTC GAATAATGAT CTGGATATGT TCAGTGAGAT GAAAACTGCG
GCAATCCTGC AGAAGTTCTT CTGGAACGAT CCGACCGTGA TGCCGGCAGC CGATGCGCTG
AAAATCGCCT CGCCTAACGG GGCAAATGCA TTGGGTCTCA ATGCAGGAGT GATCGCTCCG
GGCCATCTTG CAGATCTGGT CCTTGTTGGA CGAAATCCGC TGAATGTACC TGCATTCAAC
ACAGATTCAA ATGCCGTGTA TGCAACGAGC GGTCTTGCCG TCTCGACGAC GATCTGCGAT
GGTGTGATTC TGATGCATGA CGGTATCATC CCTGGTGCGG AAGAGATCAT GGAAAAGGCG
GGATCGGTCG CCTTTGACCT TGTTAGGCGG GCGACCGCAC CCTAA
 
Protein sequence
MTDTDIFGKQ VPVLIRNVSL NGKIQEIYLD GTGMIGAVGE KITDHDAEFI IDGDGATALP 
GMINMHTHSP MGLLRGYSDD MQLFEWLSTK IWPTEAHLTE DDIYWGAKLA CLEMIRTGTT
TFNDMYFKME QIARAVDESG IRACLSYCMI DGGDHAKFES EARVMESTVK NIKNMNNPRV
MPGVSPHAVY TVSKEGLTWC SEFAKKENIP LHVHLSETEQ EVTDCVAAHG MRPPAWLDHC
GVLSEQCIAA HCCWLDADDI SLLAKRGVTA VHNPISNMKL AGNRALPYPE MKAAGVNVAL
GTDGASSNND LDMFSEMKTA AILQKFFWND PTVMPAADAL KIASPNGANA LGLNAGVIAP
GHLADLVLVG RNPLNVPAFN TDSNAVYATS GLAVSTTICD GVILMHDGII PGAEEIMEKA
GSVAFDLVRR ATAP