Gene Msed_0588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0588 
Symbol 
ID5105560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp541480 
End bp542784 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content49% 
IMG OID640506492 
Productamino acid permease-associated region 
Protein accessionYP_001190687 
Protein GI146303371 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.894374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.71791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGCC TATCCAAAAG GCTCAAAATA AGGGAATCCT CGCTACCTGC CTACCTAGTT 
TTCAGTCAGT CCTTGGCATC CATTGCTCCT CTAAGCTCGA CGGCGGCTTA CCTAACAGCT
ACTCTTCTCC TGGCTGGAAC CTCTAGTGGT ATTGCCTCAA TACTGGGCGT GTTGATGTAC
TCCCTTTGGG TCTATGTGGG ATATCAGCTC TCGAGGTTTT TCCCATCTGA GGGAGGGACC
TACACCTTTT CTAGACATAT GTACCCTGAA AGGGTTGCAA CTATTCTGGG ATGGATGTAC
TGGGGGAGTT ACATGTTTTA CCTGATATCT ACCTCAACTT ACGCTACCGG AGTGCTTCTC
CCTCTTCTAG GAGCACCGAT TTCCATAGAC CGGTTAATGG AGGTTGTGCT TCCATCAGCT
ATAGTACTGC TCATGATTAC CGGGATCAGG CCACCGCTTT ACTATAGTCT AGTAACTTCA
CTGGTGGAGA TTGCCGTAAT CGTGGTTCTG GGCATAACGG TGATTGCACA TAGGGGGCTT
TCCCTAGTCC CGTTAACGCC CTCCGCGGGA CTGTCTCAGG TGTTAAGCGG GGCCATGGCC
ACCTCTTTCT CGATAGCTGG TGGTGGTGCT GCCTTCTTCC TAGGGAAGGA GGCCAGAGGA
AAGGGAAAGA CTGTAAGCAA GTCTTACCTG TTGGCATTTC TCCTTGCCTC CGGAGCCATA
GTGTTCTCCT CGATTTACCT CGTAACAGCT GGAGGTTCAA CTCAGGGAGT TGAAAACCTA
GCCAATACTG GCTTTCCCGG TCTTACTGTC GCGAATCAGT ACATGGGAGA ATCCTTTGCC
TCAGCTATGC TACTACTTAC TGTGAATAGT TTAATTGGTT CTCTAATTGC AGCCTACGTG
GCCTTATCTA GACTGACCTT TTCCCTGCTT AGAACTGACT TACCAAAGTC TACCCTTATT
GTGGGTTCCC TCTTCCTGGG AATCAACGGG GTAATAGCTG GGCTGGGGAA TCTAGTACAG
TGGTATCAAT ACTTTTTCCT GGGCTCGTTA ACCGCACTCT TCATCACACA TGCCTCGCTT
TCCCTAGGCC TTCCCAAGAT CAGGAATAAG CTAGCCTTAA GCCTTCTCAA GTCCTTTCCG
GGGATTCTCT CAGCCCTTCT CATGATGGTA GGCCTTTACT CCATTTACCT TGAGGTTGGA
GAGGAACTGG TTGTGGGGAT CTTAGCGTGT ATAGTTCTCG TCATGGTAGG AGTGATTCAG
GGCCTAATAT CCCGAAGAGG GGAGGGAAAG AGTAATATTT CATGA
 
Protein sequence
MASLSKRLKI RESSLPAYLV FSQSLASIAP LSSTAAYLTA TLLLAGTSSG IASILGVLMY 
SLWVYVGYQL SRFFPSEGGT YTFSRHMYPE RVATILGWMY WGSYMFYLIS TSTYATGVLL
PLLGAPISID RLMEVVLPSA IVLLMITGIR PPLYYSLVTS LVEIAVIVVL GITVIAHRGL
SLVPLTPSAG LSQVLSGAMA TSFSIAGGGA AFFLGKEARG KGKTVSKSYL LAFLLASGAI
VFSSIYLVTA GGSTQGVENL ANTGFPGLTV ANQYMGESFA SAMLLLTVNS LIGSLIAAYV
ALSRLTFSLL RTDLPKSTLI VGSLFLGING VIAGLGNLVQ WYQYFFLGSL TALFITHASL
SLGLPKIRNK LALSLLKSFP GILSALLMMV GLYSIYLEVG EELVVGILAC IVLVMVGVIQ
GLISRRGEGK SNIS