Gene Msed_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0254 
Symbol 
ID5103874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp213880 
End bp215199 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content47% 
IMG OID640506160 
Productmajor facilitator transporter 
Protein accessionYP_001190355 
Protein GI146303039 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.180707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCTT TAGATAAAGT AGATAAGGCT GTATGGACCT CAACCCATAG TCTACTCTTT 
GCTTCTCTGG CCTTAGGCTT CTTCATGTGG GGAACAATTA GTACCATAGC TCCCCTTCTA
TACCCCTCTA TAAACAACGT GTTCTTTATC ATAGCACCCA TAGTCGCAAC CCTAGCTGGG
AACTTGATCT TCCCTTTCAT CTCTGACAAG ATGTACGGAA GGAAAAGGAC CTTTATTGTT
ACAATGTCTA TGTACGGTAG TGGAGCGCTC ATTATAGCTA TTGTGTCCCT GGTTTCACAA
TTCACTAAGA TTCCTCTCAC TAGTCCCGCT CTACTTTACA CCCTAACCTT TGGAATAGTC
CTAGGAGTAC TTGGGGTGGA AGGGGAAGTT CCAGTTATGT TGTCGTATGC GGCTGAGATG
ATGCCCATCG TGAGGAGGGA CCAGGTTCTA GTTCTTGCCC CAAACTTTGA CAATATAGGG
GCCATGGTAG CCTCAGCAAT TGTTCTTGTA TCGGCCTCGT CAAGTGCTCC AACTCTAGAG
TTGCTCTCCC TGTCCCTCAC CGCACTAGTA GGTTTAGGTT TTCTCATAGC GGTGAGACTT
AGATTACCCG AGTCCGTTAG ATGGCTATAC GTGAAGGGGT TTAGGGAGAG AGTAGAGGCC
GAACTTTCCA AGTTGGGGAA CAGGATACAA GAGGTCAAAG AGAACCGGAA CGTAAGCAAG
TTGAGCCTGC TCTCTAGATA CTGGTTCTTG GTTGCGATTG CCATATCGCA ATACCTGACC
TACGGCCTCA TGGCCTTCTA CATAGGAGAT TTCTATTTCC CGAGTCTGGA GAATTTCATT
GTGTTTATTG CTAACGTAGG AGCTAGCGTA GCTGGGGTAA TTGCGGGCTT CGCAGTTAAC
AGGGTAAAGA GCAGGAAATT CTCACTTTTC TCGTTCCTGG GAGGGACAGT CACGATCCTG
GGAATACTTC TCACAATCAA CTCTGTCTCC AGTAACATGG GCCTATTTTA CGGCCTCCTT
CTCCTTAACA TGGCCTTTAG TGAGTTCGGC TGGGCTGTGA GAACCATTTA CGAACCCCTA
ATCCTTCCAA GCAGTAATAG GGCCTTCATG ATAGGGCTCG TTAGAGTCTT TCCCATCACT
CTGAGCTCCC TCTCTGTGTA CTTTACGAGT TTTATTAACT CCCCGTTCCT TTACGTGCTA
TATAATACCG CCCTATGGGC CCTAGGAGCC ATTGCGACCA TTACCTGGTA CTTCAAGGGC
TACGACGTAA ACATGACTCC CATAGAAGTA TCGTCCCAAA GCGTTGTGAA AGAGGGTTAA
 
Protein sequence
MEPLDKVDKA VWTSTHSLLF ASLALGFFMW GTISTIAPLL YPSINNVFFI IAPIVATLAG 
NLIFPFISDK MYGRKRTFIV TMSMYGSGAL IIAIVSLVSQ FTKIPLTSPA LLYTLTFGIV
LGVLGVEGEV PVMLSYAAEM MPIVRRDQVL VLAPNFDNIG AMVASAIVLV SASSSAPTLE
LLSLSLTALV GLGFLIAVRL RLPESVRWLY VKGFRERVEA ELSKLGNRIQ EVKENRNVSK
LSLLSRYWFL VAIAISQYLT YGLMAFYIGD FYFPSLENFI VFIANVGASV AGVIAGFAVN
RVKSRKFSLF SFLGGTVTIL GILLTINSVS SNMGLFYGLL LLNMAFSEFG WAVRTIYEPL
ILPSSNRAFM IGLVRVFPIT LSSLSVYFTS FINSPFLYVL YNTALWALGA IATITWYFKG
YDVNMTPIEV SSQSVVKEG