Gene Msed_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2189 
Symbol 
ID5105410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2100279 
End bp2101697 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content40% 
IMG OID640508083 
Producthypothetical protein 
Protein accessionYP_001192252 
Protein GI146304936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000970823 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000755051 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGAAAG CTCTAACTAT CCTATTAGTT ATATTGATAC TTCCGAGCCT ACTTTCAGGA 
CTAAGCAATA TCGTGGCATT TGCTCAAAGT GGAAGCGGTA GTAATAATTT AGGGACCTTT
ATCTCGACAC TACAAACATT GATACCTGCA GCACTGCTAT TGCTTGCAAT TTTGGCAAAC
AGATACGATC AACGTTATGC TTTAGTGTTG TTCGCAGCTG CTTTAGCTTC AGCGTTATTT
CTAGCAGCTG CTACGGGAGG AAAACTAGGA GGAAGCGTAA ACATAACATT AGTAGAACTG
AAAGTAACAG TTTCTGGTCC CACTTCGGCT TATACTGGAA ACGCTGAAAC TTATACAGTG
TCCTGGAGCC CATCTATGTC TGGAACAGTG ATATGGACAG TATTATATAA TGGGAGCATA
GTATACAATT CCACAGGCGG AACTTCGTTC ACCTATACAT TCAATGAACC TGGAAAATAC
ATAATTGCTG CGGCTGTAAT TAATGATCAA AACTATGCAG GAGGGTCTGG AGCAGTCTTA
ATAACTGTAA CGAACCCACC TAGCCCCTTG GGCTGGCTTA CTGGGGCTAT AACGAGCGCA
ATCTCTGGAT TTATTAACAC TGCAGCTAAC ACATTTGAAG GATTTTTGAC TACACTTTTA
CAGATTTTCG GAGCACCTCT AGAATGGATG ACTTATTCCC CTACGCCCAA TATTTCTCCA
ATTACTCAAA CGATATATAA TGAAATCCAA GATTATGCTA TCGGGCTAGC TATGTTGTTC
GTATCTTTCT CGATAGCATA CAACGCATTC AGAGGGTTCT ATTCGGACCT CGTTGATCTT
GCTGGTGATG TCCTTTACAA GCTCGGCGTA TGGGGACTAT TCTATGCAGG TGGGATGATA
GTATACACTT ATGCTGCGAA TTTCATTAAT TCTATCATTT ACTCAGTCGC AGGTCCATAT
CTAGGAATAG CAACGATAGA ATATACAGGA GGAGCAACTC TCTACACGGC CCTATTTGCA
TTAATGAACG GGGTTCCCTT TGGGTTTGGG GATTCTTTAG ATATGTTTTT ATCTCTAGTT
ATGTTTCTCT TAGCTTTCAC ATTAGCGGTA GCAACAATAA AATATGCTGT AATGTTATCT
GTTGTATCTA CTATTCCACT ATGGGCTTCA CTCTGGATAT TTGAATGGAC TAGAAAAATT
GCGATGATGG TAGTAGATTT GCTAATAGGC TTAATGGTAG CGGGTTTGGT AGCCGCAATA
ACTTTTGCAA TCTTAGCAAC ATTACCTATC GGAGCCTTGC TGTTTATCAT AGACCCTATA
GCGATTGATG GAGAATTCCT GTTCTCGTTA GTGCTTTTTG TGTTTAGTTT GAGACCAGGA
CAACATATGG TAGGAGCAAT TAGAGAATTG TCTTCATAA
 
Protein sequence
MRKALTILLV ILILPSLLSG LSNIVAFAQS GSGSNNLGTF ISTLQTLIPA ALLLLAILAN 
RYDQRYALVL FAAALASALF LAAATGGKLG GSVNITLVEL KVTVSGPTSA YTGNAETYTV
SWSPSMSGTV IWTVLYNGSI VYNSTGGTSF TYTFNEPGKY IIAAAVINDQ NYAGGSGAVL
ITVTNPPSPL GWLTGAITSA ISGFINTAAN TFEGFLTTLL QIFGAPLEWM TYSPTPNISP
ITQTIYNEIQ DYAIGLAMLF VSFSIAYNAF RGFYSDLVDL AGDVLYKLGV WGLFYAGGMI
VYTYAANFIN SIIYSVAGPY LGIATIEYTG GATLYTALFA LMNGVPFGFG DSLDMFLSLV
MFLLAFTLAV ATIKYAVMLS VVSTIPLWAS LWIFEWTRKI AMMVVDLLIG LMVAGLVAAI
TFAILATLPI GALLFIIDPI AIDGEFLFSL VLFVFSLRPG QHMVGAIREL SS