Gene Msed_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1094 
Symbol 
ID5103568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1020145 
End bp1021428 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content52% 
IMG OID640506989 
Productmajor facilitator transporter 
Protein accessionYP_001191182 
Protein GI146303866 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAGTC CTCCTGGAAA ACTAAAATCC TTCTTCATCT CCTCCGCAGG GTTCCTCCTT 
GACGGATATG ATCTCTCGGT GATATCCTTC GCGCTTCTGT TCCTTCCGAA GGAACTTCAT
CTTACCCCAT TACAGGAGGG ACTCGTTAGC TCTGCCTCGC TCATGGGAAT GATACTCGGT
TCAGTCCTAC TCGGGTTACT CTCCGACAAG ATGGGGAGGA AAAGGCTCAT GGGCTTGGAT
CTAGTAATCT TCACGGTCTT CGCCATAACC TCGGCCCTGT CCCAGAACTT CCTGGAAATG
TTCCTATCTA GGCTACTCCT GGGGGTTGGC ATAGGTGGAG ATTATCCCCT AAGTAGTTCC
CTCATGGCTG AGTACTCCCC CTCAAGGTCG AGGGGAAGGT ACCTCGTGGG GGCAGTGTCC
ATGTATTGGG TAGGAACACT GCTCTCTGCT GTCGTGAACC TAGTCTTCCT TCCCACGGGT
GACTATTTCT GGAGGTATTC CTTCGCGTTT GGAGCCCTTC TATCCATCCC AGTCATAGTA
GCCAGGTTCT CTCTCCCCGA GTCACCCAGA TGGTTAATAA GCAAGGGTAA ACTTAAGGGA
GATGGAATCC CAACCCAAGA GGAGGAAAAC AAGGGAGTTA CAGGTTTCCT TGACCTGTTC
AGGATGAGAT TACTTCCATA CCTCCTCCTA GTCTCAGCAA TCTGGTTCTT GTTTGACGTT
GCGTCATACG GTATAGGACT TTACTACCCA GCAATATTTA GGGAGTTCTC TTTACCCTCC
AACTACGAGG TGATTTACGC CACCATGATA ATCGCGGTGG GAGCAATCCT CGGCTATATC
CTGGCGGAGG TCGCCATAGA TTCGCTGGGA AGGAGAGCTG TTCTTCTATC CGGGCTTGGC
GTAATGGCAC TTCTCCTGGC TGTGGGAGGT GTCCTGAGGC TTACCGGGGT TGTTTTGGTG
CCATACTTTG CGGTCTTCGT GGCAATGGAG CAGTGGGCTG GCGCGGTCAC ACTCTTTTAC
CCCGCTGAGC TCTTCCCTAC CCCAGTTAGG TCATCCGCTC AAGGATTTGC GACAGCAGTG
AGCAGGATAG GAGCTGTCCT GGGAGTCGTG TTTTTCCCTA GCATGGTGAA GGTCCTTGGT
CTCTCTAACT CCCTGATTCT GTTCTCTGTA ACGTCGGCTA TCGCATTCAT ATTGGCACTC
CTGCTGAGGG AAACTAAGAG AAAGGAACTA GAGGAGATCT CCCTTGGGCT AAAGGAGGTG
AAAGGGAGAA ATCCGAGTAC ATGA
 
Protein sequence
MSSPPGKLKS FFISSAGFLL DGYDLSVISF ALLFLPKELH LTPLQEGLVS SASLMGMILG 
SVLLGLLSDK MGRKRLMGLD LVIFTVFAIT SALSQNFLEM FLSRLLLGVG IGGDYPLSSS
LMAEYSPSRS RGRYLVGAVS MYWVGTLLSA VVNLVFLPTG DYFWRYSFAF GALLSIPVIV
ARFSLPESPR WLISKGKLKG DGIPTQEEEN KGVTGFLDLF RMRLLPYLLL VSAIWFLFDV
ASYGIGLYYP AIFREFSLPS NYEVIYATMI IAVGAILGYI LAEVAIDSLG RRAVLLSGLG
VMALLLAVGG VLRLTGVVLV PYFAVFVAME QWAGAVTLFY PAELFPTPVR SSAQGFATAV
SRIGAVLGVV FFPSMVKVLG LSNSLILFSV TSAIAFILAL LLRETKRKEL EEISLGLKEV
KGRNPST