Gene Msed_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2074 
Symbol 
ID5105054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1992137 
End bp1993402 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content46% 
IMG OID640507964 
ProductGlu/Leu/Phe/Val dehydrogenase, C terminal 
Protein accessionYP_001192138 
Protein GI146304822 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0334] Glutamate dehydrogenase/leucine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.531977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGG TAGAAGAAGT TTTAACTTCG AATTTATATA CACAACAAAC TAAAAAATTA 
TATAAAATAG GGGAGATCCT GGGGCTCCAT GAGGATCAGC TAACTGCTCT TTCCACCCCT
GAAAGGGTGA TTCAAGTAAA GATACAGATT AAGGGAAAAG ACGGCACGAT TAAGACCTTC
ACCGGTTGGA GATCTCAGCA TAATAGTGCG CTGGGTCCTT ACAAGGGAGG AGTTAGATTC
CACCCCAACG TTACGCAGGA CGAAGTTATA GCCCTATCAA TGATAATGAC TTGGAAGAAT
TCGCTTCTCC AGCTTCCCTA CGGAGGAGGT AAGGCTGGAG TTAGGGTAGA TCCTAAGTCC
CTAAGTAAAG AGGAGTTGGA ACAGTTATCT AGGAACTTCA TTGATGCCAT CTACAAGTAT
ATCGGTAGCG ATATAGACGT ACCTGCGCCT GATGTAAACA CGGACTCACA GATAATGTCG
TGGTTCCTTG ACGAATACAC AAAGATTTCA GGTAAGATAG ACCCAGCTAC CTTCACGGGA
AAACCCATAG ATCTAGGAGG ACTTGCAGTA AGGGAGTTCA GTACTGGTCT CGGAGTGGTC
CATACTGCAA AGCTCGCCGC TGAGAAATTT CTGGGGGGAT TAGAGGGTAG GAGGGTAATC
ATCCAAGGAT TTGGTAATCT AGGAAGTTTC GCTGCGAAGT TCTTTGAGGA GAACGGAGCC
ATAGTCATAG GCGTGAGCGA TTCAAAGGGT GGAGTAATAG ATCCCAACGG GCTGAGTTAC
TCGAAGTTAG AAGAGGTGAA AAAGAGCACT GGCTCTGTAG TTAACTATCC GTCGGGAAAG
AAAGTTACCA ATGATGAGCT ACTCATAACC GAAACTGACA TACTTGTTCC CGCAGCACTT
GAGAACGTGA TCCATAAATA CAATGCCCCT AAGATAAAGG CCAAGCTTAT TGTTGAGGGG
GCCAATGGAC CATTAACAGC CGATGCAGAC GCCATCCTTA AGGAGAGAGG AATTCCAGTG
GTTCCCGATA TTTTGGCGAA CTCTGGGGGA GTTGTGGGGA GCTATGTGGA ATGGGCCAAC
AACAGAATGG GTGAGATAAT AAATGAGGAA GACGCCAAGA AACTAATACT CAGCAGGATG
GAGAAGGCGT TTAGCGAGGT GTATATCAAG TACAACTCGC TGAGTGACCA AGACCTGAGG
ACTGCGGCTA TGGTGGTAGC GGTAGAAAGA GTAGTAAGGG CGATGAAGGT TAGAGGGTTA
ATATAA
 
Protein sequence
MTSVEEVLTS NLYTQQTKKL YKIGEILGLH EDQLTALSTP ERVIQVKIQI KGKDGTIKTF 
TGWRSQHNSA LGPYKGGVRF HPNVTQDEVI ALSMIMTWKN SLLQLPYGGG KAGVRVDPKS
LSKEELEQLS RNFIDAIYKY IGSDIDVPAP DVNTDSQIMS WFLDEYTKIS GKIDPATFTG
KPIDLGGLAV REFSTGLGVV HTAKLAAEKF LGGLEGRRVI IQGFGNLGSF AAKFFEENGA
IVIGVSDSKG GVIDPNGLSY SKLEEVKKST GSVVNYPSGK KVTNDELLIT ETDILVPAAL
ENVIHKYNAP KIKAKLIVEG ANGPLTADAD AILKERGIPV VPDILANSGG VVGSYVEWAN
NRMGEIINEE DAKKLILSRM EKAFSEVYIK YNSLSDQDLR TAAMVVAVER VVRAMKVRGL
I