Gene Msed_2162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2162 
Symbol 
ID5104201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2077742 
End bp2078854 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content47% 
IMG OID640508054 
Productmajor facilitator transporter 
Protein accessionYP_001192225 
Protein GI146304909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000505649 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGACCGT TTCTTATTTT CGTAACGTCG TCGTCATTCT TCCTAGGATA CTTTGCGAGG 
ATAGCGTGGA GCATCGTGTC CGTGTATTCC ACTCTAAGAC CCACCGAAAT TCAGGATAGT
GTAATATTTT CCCTTTTCTT TCTTGGTTAC GTAATTGTTC AGATCCCATC GGGCATGATA
TCCGACAGGA GACCTAGGGA GGTAGTGATC TTGGCTCTTG TAGGTCTCGC GATCTCCTCC
TTTCTCTCCG GCTTCTCGAC TTCAATCCTT CAGGAATACG TGGCCAGCCT GTTGATGGGT
CTCTCCGCGG GATGGATATA CCCCGTCACC ATAAAGATAT TAGCATCATC CTTTGACAGG
CGAGAGTTAC CTGTGGCAAT AGGCTACTAC AGCCTGGCGT GGCCACTTTC CATTATTCTT
GCAGGCTTAA CCTTACCCTA CCTCTCCATA AACATTGGAT GGAGATACTC ATACTACATG
ATCTCCCTTC TCTGCGTCAT TGTGGCGTTA CTTTACCTGA AGGTGAGGGT TGAAGGGGGA
GGAAATTCAG GAAAGTTTCA GCTAATAAAG GACAGGAACG TTATTGCGGT GAGTATGGCT
GGCTTTTTGT TTTTTCTCTC ATACTGGATA ATAACCCTTT ACGCTTATAA ATACTTCTTG
AAGGTAGGTC TTAACGGATA CGAGGCTGGT ATTGCGTATT CCTTTCTAGC CGTGGCTGGA
ATACCCTCTA CCGTGATTGC CGGTTACTTA ATACGAAGGA TGGGAGTTAG AACTACCTTA
TCAACCTTTG AGGGGTTTTA TGGAGTGTTG ACTATCCTTC TGTCCTTTCT AGTTTCAAGT
GTATCTCTCT TCATTATCTC ATTCCTTATG GGATTCGTGA GATTCGTCAT TACTCCCGCC
AATTCCAGCG CGGTCTCATT GATAGATAAG GGAAGGGCGG GTAGCGTGTC TGGCTTCGCC
AACTTTTTCT GGCAGAGTAG CGGGATCGTG GCTCCATTAC TCGCGTCCCT CGTGGTGATT
CAGCAGGGTT ATCACGTTCT GTGGATAGTA GCGGGGGTCG TAATACTCCT GTCAGCGGTG
CTGTATAGGG TCTTGTTGAG AATAGAGAGG TAG
 
Protein sequence
MRPFLIFVTS SSFFLGYFAR IAWSIVSVYS TLRPTEIQDS VIFSLFFLGY VIVQIPSGMI 
SDRRPREVVI LALVGLAISS FLSGFSTSIL QEYVASLLMG LSAGWIYPVT IKILASSFDR
RELPVAIGYY SLAWPLSIIL AGLTLPYLSI NIGWRYSYYM ISLLCVIVAL LYLKVRVEGG
GNSGKFQLIK DRNVIAVSMA GFLFFLSYWI ITLYAYKYFL KVGLNGYEAG IAYSFLAVAG
IPSTVIAGYL IRRMGVRTTL STFEGFYGVL TILLSFLVSS VSLFIISFLM GFVRFVITPA
NSSAVSLIDK GRAGSVSGFA NFFWQSSGIV APLLASLVVI QQGYHVLWIV AGVVILLSAV
LYRVLLRIER