Gene Mlab_0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0367 
Symbol 
ID4794864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp344862 
End bp346076 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content52% 
IMG OID640099018 
Producthypothetical protein 
Protein accessionYP_001029810 
Protein GI124485194 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAA ATAGAGCAGT AATGGTCAGG ATCGGGGAGC TTTGGCTCAA AAGCGAGCCG 
GTCAAAAAAC AGTTCATGCT TGCGCTAACG CGAAATATCA AAGCAGCTCT GGATACGCAG
GAAATCCCAT ACATGTTAGA GGAATACCGG GGAAGACTGC TGATCTTCGG CGATGCCGCA
AGAATCGCCC CGGTCGTAGC GAGGATATTT GGAATCGTTG ACGTCAGTAT CTGCGAAACG
ACCACAAACC GTCCCGAAGA CATGGCAAAA ACCGCCCTGA CGTTTTCTGA GAAAAAACTC
AAATCCGGTA TGCGTTTTGC GGTCAGAGCG CGACGCCAGC ATGTGAGCGG ATTCACCAGT
CAGCAGCTTG CCGGCATGAT CGCCGATGCC ATTTGGGAGA AGATCCCGGA TTTCGTTGTC
GATCTCGATG ACCCGGAGTA TGAAATATTC GTCGAGGCAC GGGAATACGG TGGAATCGTA
TATGATGAGA GAATTCCAGG ACAAGGCGGA CTTCCCCTTG GAACAGCAGG CCGGGCAGTA
GCGCTTCTTT CCGCAGGCAT CGATTCGCCC GTTGCCGCAT GGCTGATGAT GAGGAGAGGC
GTAGTCATCT CCGGTGTATT TATGGATGGG GGAAGATGGG CGGGATCTGC CACGCGTGAC
TTGGCAATGG ATAATGTCAG AATTCTCTCC ACCTGGTGTC CCGGAAGGGG TCTGCCGCTT
TGGATCGTGA ATCTCGAGCC GTTCTTCGAT GCGATGATGA CTGCATGTGA CCGACATTAC
ACCTGCCTAT TCTGTAAGAG ATTCATGATG CGGGTGGCTG AAGAGGTGGC GAAAGAGAAT
AAAATGGAGG GGATCGTCTC CGGAGAAAAT CTCGGGCAGG TGGCATCACA AACTTTGCAG
AATATGGGCG TCATCACCGA ATCGGTGAAG CTTCCAGTTT TGCGCCCTCT CCTGACCTAT
GATAAAGAGG AGATCGTAGC GATCTCACGA AGAATCGGCA CCTATCACGA AAGCCCAGGC
GACACGGGGT GCCTCGCGGT CCCCAAAAAA CCGGCAACCC GCTCCGCACA GGATCTGATC
GACACTGAAG AAAACAAGCT CGAGATGAAT GAGCTCGTTC GGCAGGCCGT TGAATCAGCA
GAGCTTTGGA TCGCCAAAGA CGGAGAGATC TTCCAAAAAA TACTCACTGA GAGAACGATC
GATTCTTCCG TTTAA
 
Protein sequence
MEQNRAVMVR IGELWLKSEP VKKQFMLALT RNIKAALDTQ EIPYMLEEYR GRLLIFGDAA 
RIAPVVARIF GIVDVSICET TTNRPEDMAK TALTFSEKKL KSGMRFAVRA RRQHVSGFTS
QQLAGMIADA IWEKIPDFVV DLDDPEYEIF VEAREYGGIV YDERIPGQGG LPLGTAGRAV
ALLSAGIDSP VAAWLMMRRG VVISGVFMDG GRWAGSATRD LAMDNVRILS TWCPGRGLPL
WIVNLEPFFD AMMTACDRHY TCLFCKRFMM RVAEEVAKEN KMEGIVSGEN LGQVASQTLQ
NMGVITESVK LPVLRPLLTY DKEEIVAISR RIGTYHESPG DTGCLAVPKK PATRSAQDLI
DTEENKLEMN ELVRQAVESA ELWIAKDGEI FQKILTERTI DSSV