Gene Mlab_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1250 
Symbol 
ID4794784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1275227 
End bp1276387 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content50% 
IMG OID640099929 
Producthypothetical protein 
Protein accessionYP_001030686 
Protein GI124486070 
COG category 
COG ID 
TIGRFAM ID[TIGR02537] archaeal flagellin N-terminal-like domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.138271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAA GACCGGAAGC CGCAATATCG CCGACGATCG GTGTGATACT TCTCATTTTT 
CTAACCGTGG TTCTCGTCGG CGCAGCTGCC CTCGTTTTTT TCGGATTTTC CGGCGATTTG
ACCGAGCCGA AACATGCATA TATCACCGCG GAAACAACGG ACAATGCCAC AAATCCCCTG
GTCCTGCGTG TTTGGGACAT TGGTGACGGA ACGGTGCTGA AAGATCTGGT CGTTTCGGTA
AACACGCCGG AGGGAGTCAG CATCGGAACA CCAACCAAGA TTTCGAATGC ATTTTATGTG
GGAGAGACCA TCCCAATCGA ATTCGATGAC AGCTTTCCAA AAGGGAGCTA CCTTGTCACG
GTAACCGGCG AATTTGCCGA CGGAGCAAAC CAGGTACTTT TTACCAAAAT CATGGACCTT
GGAGCCGACG GCAGGAAGAT TCAGGAAGTC AGTACGGAAA AGCTGTTCCT TATGGCGGAT
TATGCGTACT GGAAACCAAA CAAGCTTTAT CTGACGGACA CGACCCCCTA CAAAAATATC
TTGAACATCT CGTACTGGAC GCTTGATCTC GGCTATTCGG GCGTGGACAT ACAAGTATTG
TCCCACCCGG CAGATGATTA CAGCTATACT TACCCGACCG AGGCATTCAA CGGCACATCA
CAGACATTCG TGATCACCTA TACTGCATAT TATATCGACG GAAAATCCAA GACGACCGTG
CAGAAAGAAG TACAGCTCTA CGTCAGCGAA CAGCCGGACC AACTGGGAGA ATATGTGAAA
AACTACAAGA TCGGTGGAGA ATCCCTGCAG GGAACGGGAG ATTCAACGCA TCACATACCC
CTTGTCGGGA TACTGGCAAC CACGATCTGG AGAGAGGATA TCGACGGGCG ATATATCATA
GTAGACGGAG AATACCGCCC CGCCCTCCAG GTCAATCTGA ACGATCCAAA AGAAGGAGCC
AGTTCCGTTT CGGTCACCAG CGACACGGAA TTCCGTGTGG GAAGCACATA CTTTGCACCC
GGCGAAACCT ATGCGGGAAC CTTTTCGACC TTCTATCTCA ATACGGCAAA CAGGACCGCG
ATCAATGTCA CCTTAAAAAT ATTCGACGCA AGTAATACCA AGCTGGCCGA ACAGACAACA
CTGATCACCA TCAGGGACTA A
 
Protein sequence
MKARPEAAIS PTIGVILLIF LTVVLVGAAA LVFFGFSGDL TEPKHAYITA ETTDNATNPL 
VLRVWDIGDG TVLKDLVVSV NTPEGVSIGT PTKISNAFYV GETIPIEFDD SFPKGSYLVT
VTGEFADGAN QVLFTKIMDL GADGRKIQEV STEKLFLMAD YAYWKPNKLY LTDTTPYKNI
LNISYWTLDL GYSGVDIQVL SHPADDYSYT YPTEAFNGTS QTFVITYTAY YIDGKSKTTV
QKEVQLYVSE QPDQLGEYVK NYKIGGESLQ GTGDSTHHIP LVGILATTIW REDIDGRYII
VDGEYRPALQ VNLNDPKEGA SSVSVTSDTE FRVGSTYFAP GETYAGTFST FYLNTANRTA
INVTLKIFDA SNTKLAEQTT LITIRD