Gene Mlab_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0195 
Symbol 
ID4795758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp182859 
End bp184034 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content52% 
IMG OID640098841 
Producthypothetical protein 
Protein accessionYP_001029638 
Protein GI124485022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000146938 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA ATAATCTTCG ACACCTACTT CTTACAGCAG TCATCGTCTT CACGCTGATC 
ATTTCCGGCA CAGGGGCGGT ATCGGCTGAA ACCCAGTCTA CTTTCGTAAT CAAGGATTTC
TCCATGTATC CAACGTCTCT GATGTCGGGC GACTCGGGGA TCGTCACGGT CACGATACAA
AATACCGGAA CAACATTCGT ACCAGTCAAC CAGATCTTCA TCAAAGATAT TGACGGCATC
AAATCATCAG TGACCCCGTA TCAGAACCCG ATCGGCGGCG TTGGAGCCGG CGATACCTTC
ACCTTATCTC TGCCGATCAC TTCGACAGGC GAAACCGGCA CATTCTACCC GGTTCTGTAT
GTCGACTTCG GCGGCAACAA CGGAAATTAT CTGAAGTATC CGTTTGCAGT CATCGTTGAC
GATCAAAGCG TCGTCATCTC GATCACCAAC CGCCCCGATG TCTTCGAGCC GGACACCACG
CAAACCGTCG CTTTCACGAT CGGAAACCTG AGAACGAATG CGATCGAAGC GGTCGAAGTC
ACGGCGTCCG GAACCGGCGT CACCACCAAA CAGACCTCGG TCTTCCTTGG AACGATCGAT
GCAAACAAAG CAGCAAGCGG AAACATAAGC GTTACCACAA CTGCAGAGAC GAAAGAGGTC
ACCTTTGATG TGACCTACCG GAACGGAGCC AACTGGCATA CCGAAAGCGT CACGCTTCCT
CTTGAGTCGG GCATTTCCAA AACCGGCGCT GAACTGATCG TCAACAACCT CGAAGTGAAG
AACTCGGGAA CGTACTATAC GATCACCGGC GATGTCAATA ACGCCGGCCT TACAACAGCC
AAGGCGCTCG TCGTCACAAC GGAAGGAGTG ACCAAAACAG GACTCTACCC GTCCTACGTT
GTTGGATCCA TGGATGAAGA CGGTCTTTCC GAGTTCGAAG TGACCTTCAG TAACCCGGCA
AGCGACAACG TAACTCTGGT CTTTACCTAT AAGGATGCGA ACGGAAACGT CTACACGGAA
AAACAGACAG TCCCAATCAC GTCGGCAGTC ACCGAAGCAC AGACGGCAGG TGAATCAAGC
CCGGTCGCCA CCGTCTTGAT CGTTATTGTT ATCCTCGTCA TTCTTGCCGG CGGTTTCGTT
GCCTGGAAGA AAGGTAAAAT ATTTGCACGG AAGTAA
 
Protein sequence
MNRNNLRHLL LTAVIVFTLI ISGTGAVSAE TQSTFVIKDF SMYPTSLMSG DSGIVTVTIQ 
NTGTTFVPVN QIFIKDIDGI KSSVTPYQNP IGGVGAGDTF TLSLPITSTG ETGTFYPVLY
VDFGGNNGNY LKYPFAVIVD DQSVVISITN RPDVFEPDTT QTVAFTIGNL RTNAIEAVEV
TASGTGVTTK QTSVFLGTID ANKAASGNIS VTTTAETKEV TFDVTYRNGA NWHTESVTLP
LESGISKTGA ELIVNNLEVK NSGTYYTITG DVNNAGLTTA KALVVTTEGV TKTGLYPSYV
VGSMDEDGLS EFEVTFSNPA SDNVTLVFTY KDANGNVYTE KQTVPITSAV TEAQTAGESS
PVATVLIVIV ILVILAGGFV AWKKGKIFAR K