Gene Msed_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2149 
Symbol 
ID5104888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2064015 
End bp2065292 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content47% 
IMG OID640508040 
ProductAIR synthase-like protein 
Protein accessionYP_001192212 
Protein GI146304896 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1973] Hydrogenase maturation factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000105335 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000422525 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCTAG AGGGATATGC AAGGAGATTG TGGAATCACC TCGACGAGTC TCAAATGAGG 
GAAGAACTAC TTCGCTGGTT AGAATTCTAT AAGGGAAAAA GGGAGCTTAA TCAGGATTTT
GTGGACGCCG TAATAAGGGA GGTCAAGAAT TCAGAGAATT TTAAGGAATT CTCCTTCACG
AGGGTAGGCC TCACGGCAGG AGACAGTGGC CTAGGTTCCA GGGGTCTTGG GGATAATCTA
ATTCATCTGA AATTATTTGA GCTCAGTAAG AGGAATCTCG AGACTTTTGA TGATGCGGGA
ATAGTTCAGG ACATAGTTGT CTCTGTGGAC GGAATACACT CCAGACTATC CTACTTCCCT
TTTCTAGCTG GATTTCATGC AACAAAGGCC ACCCTTAGGG ATATCATGGT GAAAGGTGCA
ATCCCGCTGG GCATCCTTGT GGATATTCAC CTTTCCGACG ATAGCGATGT TTCCATGCTC
TTCGACTTTG AAGCCGGTGT ATCAACTGTG GCTGATGCCT TGAACGTACC AATTCTGGCT
GGTAGCACGC TGAGGATTGG TGGTGATATG GTCCTGGGAG AGAGGATAAG TGGGGGCGTT
GCCTCTGTGG GGAGACTCCA GGGAGAACCA TTCACTAGAA AGAGAATTAG TGAGGGACAA
CATATAGTCA TGACAGAAGG CCATGGAGGT GGAACAATCT CGTCCATGGC CATATTTCAT
GGCATTGAGG GTGTAGTGGA GGAGACCCTA AGGGTAAAGG ATCTTGAGGC ATGTCTTGCC
GTGAGACGTG TTAGAAATCT CGTAAGCTCC ATGACAGACG TTACTAACGG TGGTATAAGG
GGTGATGCGT TAGAGATTTC GGAGGTAACT AACGTAAGCC TTGTGATAGA CGAGGATGAA
TTCCTCTCTC TCATAAACCC AAGGATCAGG AAGGCCATGA ATGAATTGGG CATAGACCCC
TTTGGTCTCT CGCTTGATTC CATCCTTATT TTCACCAATA ACCCGGACGA GGTTATAAGG
ACCTTGAGGG ATAATCACGT ACAAGCTAAG ACCATAGGGG AGGTCACGCG AAGGAGAGGA
TATCCAATAG TTACCCGCGA TGGAAGGGAG ATGAGACCCG CCTTTAGGGA AAGCCCCTAC
ACTCCCATTA AAGCCGTCAT AGGAAACTAC TCCCCCATGG ATCTAGATGA GATTAAAAAG
AGACTGGAAA GGGCCTACCT GAACTCTTTG TCGAAGAAGG AAAAGGTATT GAAAAACTTA
AAAACAGGGA GTTTATAG
 
Protein sequence
MDLEGYARRL WNHLDESQMR EELLRWLEFY KGKRELNQDF VDAVIREVKN SENFKEFSFT 
RVGLTAGDSG LGSRGLGDNL IHLKLFELSK RNLETFDDAG IVQDIVVSVD GIHSRLSYFP
FLAGFHATKA TLRDIMVKGA IPLGILVDIH LSDDSDVSML FDFEAGVSTV ADALNVPILA
GSTLRIGGDM VLGERISGGV ASVGRLQGEP FTRKRISEGQ HIVMTEGHGG GTISSMAIFH
GIEGVVEETL RVKDLEACLA VRRVRNLVSS MTDVTNGGIR GDALEISEVT NVSLVIDEDE
FLSLINPRIR KAMNELGIDP FGLSLDSILI FTNNPDEVIR TLRDNHVQAK TIGEVTRRRG
YPIVTRDGRE MRPAFRESPY TPIKAVIGNY SPMDLDEIKK RLERAYLNSL SKKEKVLKNL
KTGSL