Gene Msed_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0407 
Symbol 
ID5105524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp359062 
End bp360345 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content48% 
IMG OID640506313 
Productmajor facilitator transporter 
Protein accessionYP_001190508 
Protein GI146303192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC AACCCTTGAG GAGCATATCC TCGTCCAAGA GGATAGTTAG GTTGTTGCCC 
ATTCTTTTTT ACCTCTATCT AGTAAATTTT CTAGATAGAG TTAACATATC CTATGCAATT
TCAGCGGGGA TGTTCAAGGA TTTGGGAGTT CCCAAGAGTA GCGCGGATCT TATAGCCTCC
ATTGCCTCTA GTCTATTCTT CGTAGCTTAC GCTATCCCTC AGGTATTCTC CAACCTAGGC
ATAAGCAGAA TTGGAGTTAG GAAGGTATTT GCGTTAGCCT TCACCGCATG GGGGATAATC
ACAATTCTCA CAGGGTTTGT TCAGAACGTT CCTGAAGTCT ACCTGCTTAG GTTCCTCCTT
GGACTCGCTG AAGCTCCTTT CTACGCGGGC GTAATCTTTT ACCTCAGCGT GTGGTTCCTG
AGGGACGAAA GGGGATTCGC AAATAGCCTG TTCAATGCAG CCATCCCTGT CTCAGGGATA
ATAGGAGGAC TCATAGCTGG TTCATTCTTC TCTGTGTTTG GAGATGATCC CGGATGGAGA
TACCTATTCG TGGCTGAGGG TGTACTGGCT CTCGTGTCGG TGGCTGTTAT CTGGCTTTTA
CTCACCGACT TTCCCAAGGA TGCAAAGTGG TTAAGTGAGG GGGAGAAAGA GGAACTTCTA
AGCAAGATAA AGGTTGAAAA GGAGGAGAAG CAGAAGCTAG TTTCCCACGC CTCGTGGAGG
AGGGCGCTAG GTGATAGGGA TGTACTTCTC CTGGTGCTGA TATATTTCCT TGGCGTAACG
TCACTGTACG GTTACACCAT CTGGTTGCCG TCAATCATTA AGAGCTTCGG CGTCTCCGCC
TCAACTGCAA GTTACCTCAC TGTTATACCA TATCTCGTTG CCTCAATCTC GCTCATCTTC
ATCTCCAGGT ATTCAGACAG GGCCGGAGTT AGGAAATCTC TGGCCTTGGC AATATTTCTC
GTTGCAGGGA TTGGGCTATC CTTAAGTGCA TTTACACTCA AGACGCCAGT GATTTCGTTC
CTATTCTTCG TAATCTCTGC TATTGGAATT TACAGTTTCA TTCCAGTATT CTGGACTATA
CCCACTGAAT TCCTAAGCGA GGAGTCAGCT GCAGCGTCCA TAGGACTAAT AAACGCACTG
GGCAACTTGG GTGGGATCGC TGGTCCCATC ATAGTAGGCT TCCTAGAGAG CTTAACGGGG
GTTTTCACGG CAGGTGTTTA CTCCCTCGCC CTCTTCGACA TCCTAGCAGG GCTTGTGGTA
TTACTAGTCA GAAAGAGCAG ATGA
 
Protein sequence
MSNQPLRSIS SSKRIVRLLP ILFYLYLVNF LDRVNISYAI SAGMFKDLGV PKSSADLIAS 
IASSLFFVAY AIPQVFSNLG ISRIGVRKVF ALAFTAWGII TILTGFVQNV PEVYLLRFLL
GLAEAPFYAG VIFYLSVWFL RDERGFANSL FNAAIPVSGI IGGLIAGSFF SVFGDDPGWR
YLFVAEGVLA LVSVAVIWLL LTDFPKDAKW LSEGEKEELL SKIKVEKEEK QKLVSHASWR
RALGDRDVLL LVLIYFLGVT SLYGYTIWLP SIIKSFGVSA STASYLTVIP YLVASISLIF
ISRYSDRAGV RKSLALAIFL VAGIGLSLSA FTLKTPVISF LFFVISAIGI YSFIPVFWTI
PTEFLSEESA AASIGLINAL GNLGGIAGPI IVGFLESLTG VFTAGVYSLA LFDILAGLVV
LLVRKSR