Gene Msed_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1222 
Symbol 
ID5103836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1196263 
End bp1197663 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content48% 
IMG OID640507114 
Productmajor facilitator transporter 
Protein accessionYP_001191307 
Protein GI146303991 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA GCTCTAATTT AAGGCAGTCA CCGGAATGGT ACGTGGCCAG AGTGGACAGG 
CTCCCTACAT GGGGCCTATC ATATGCCCTA ATATGGGCCA TGGGATTTTC CTTTTTTATA
ACCCTATATG ACGTCATTAA CGTGGGTTTC GCTCTTCCCT ACGTACCCTT CGTGGTAAGC
GCAGCTCAGG CGTCATTAAT AGCGTCGTTG GGTCTATGGG GTTATGTTGT GGGCGCTCCA
ATCTTTTCAT ACATTGCGGA TGTGGTGGGA AGGAGACCCA CCCTAGTCTT CACTGCCCTG
TTGACGGCGT TAGGAAGTTT CGGGGATGCC CTATCCGTGA ATTATCCCAT GCTGGCCGTG
TTCAGGTTCA TTACGGGGAT GGCCATAGGG GCTGACTTGG TATTGGTCAT GACCTACATG
GCAGAGATGT CCCCTGCCGC TAAAAGAGGT CAATACACGA ACCTTGCCTT TATAGGGGGT
TGGGCTGGAA TAGGAATTGG TCCCTTCATC GCTGCCCTCA TTGTTACCTC TATACCTTCC
ATAGGATGGA GGATAGTTTT CGTTGTGGGA GGGATTCTCG CTGCCCTTGC TCTAGCCATA
AGGGCATATG CTCCTGAGAC GGTGAGGTTC TTGGCCATGA AGGGAAAGTT CAATGAAGCT
GATAGTTTAG TTGGGCACAT GGAGACAACG TCCATGAAGA GAGCTGGCGT AAATCAATTA
CCTGAGCCCA ACATGAAGGT GTACAATGTA CCCAAGGAGA ACCCGTTCAA GGTTCTCGCT
AAGCCGAAGT ATCTTAAGAG GCTCATAATC CTGTTCCTCC TGATGTTTAC CATATACTTT
ATGGATTACC CATTCCTTGT GTTACCAGAA ACATGGGTGA AGGATGTGCT GGGATATAGC
GGGTCCCTGT TCTCCTCCGC TGTCTTCTAT TTTGGGTTAG CCGGGATTGG GGCCTTCCTA
GGAGCTATAC TTCTAAGGTT CATTATTGAC AGATTTGATA GGAGATACAT GACAGTGTTT
GGAGTTGTTG TGTTCACAAT TGGTACTGCC ATAATGGCAA TTGGAGGAAT TGCAAGAAGC
ATTCCGACAT TCTTCATTGG ATCGTTCATT GCCGAGCTCG TGGGAGTTGG ATGGTTCAAC
GTTTATTATC TGCTATGCAG TGAGAACTTT CCAACAAGTG CAAGGGCAAC TGGTTACGCC
ATTACAGACG GTATTGGACA CGCAGGAGGA GCAATTGGAT TGCTCACGGT TTTCCCGCTA
ATCCCGATTC TAGGTAATAT AGGGGCTTGG ACGGTACCGT GGATACCTGC AATAGTGATG
GCGATAGTTA CAGTGTTTAC TCTGCCAAAG ACCGTGAAGG TTAGACTAGA GGAGGTAAAT
GAAGCTACGG ATCGGGTGTG A
 
Protein sequence
MADSSNLRQS PEWYVARVDR LPTWGLSYAL IWAMGFSFFI TLYDVINVGF ALPYVPFVVS 
AAQASLIASL GLWGYVVGAP IFSYIADVVG RRPTLVFTAL LTALGSFGDA LSVNYPMLAV
FRFITGMAIG ADLVLVMTYM AEMSPAAKRG QYTNLAFIGG WAGIGIGPFI AALIVTSIPS
IGWRIVFVVG GILAALALAI RAYAPETVRF LAMKGKFNEA DSLVGHMETT SMKRAGVNQL
PEPNMKVYNV PKENPFKVLA KPKYLKRLII LFLLMFTIYF MDYPFLVLPE TWVKDVLGYS
GSLFSSAVFY FGLAGIGAFL GAILLRFIID RFDRRYMTVF GVVVFTIGTA IMAIGGIARS
IPTFFIGSFI AELVGVGWFN VYYLLCSENF PTSARATGYA ITDGIGHAGG AIGLLTVFPL
IPILGNIGAW TVPWIPAIVM AIVTVFTLPK TVKVRLEEVN EATDRV