Gene Msed_0993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0993 
Symbol 
ID5104542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp916253 
End bp917746 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content50% 
IMG OID640506892 
Productmajor facilitator transporter 
Protein accessionYP_001191085 
Protein GI146303769 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000427964 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0333525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATT CTCCAATTAT GGCAGAAAAG TCTGTGAAAG CCGGGGAAAT AATCGCCCGG 
ATGGACAGGT TACCCATCTG GTCACTCTCG TACATTTTCA TTGGAATACT GGGGATGGGA
TTTCTCTTCA CATTTTTTGA TATTTTTGAC ATTAACGTCT CATTTATCCA GACATCTCTC
ACCATATTTC ACGTTAGTAG TCCATCCTCG CCAGAGATTG GGGTTCTACT GGGACCGGCA
GTTCTCCTTA ACCTGGTTGG ATATATTGTG GGCTCCCTAC TCCTTTCCCC TCTCTCGGAT
CGGATAGGCA GGAGGAACAT GTTAATGATA ACCATGGCCA TCACGGGGCT CGGGAGTCTG
TACAACGCAC TTGTTAACGA TTATTCTAAC TTCCTCCTGG CTAGGACTAT CACAGGAATA
GGAGTCGGAG CAGACCTAGC AGTGGTCAAC ACTTACATAG GCGAAGTTGC TCCACTAAAT
GGGAGGGCAA AGTACACCAG TTTCGTGTTC CTGTTCTCAA CTCTTGGGGC TGGGTTGGGT
CTCTGGTTAG GACTCCTGTT AACAACTCCA CCTGCTCCAT TTCCCCTAGG TTTACCCTTC
GCTCTAGGAG GATCTGGCTT CCTTGCCGTA AACGGGTGGA GGGTGATGTA CGGAATTGGC
GCTCTCCTAG CCTTGATAGG CTTGCTCCTC AGGTTCAATC TTCCTGAATC TCCTAGATGG
TTGATATCCC GCGGTAGGAT AGCTGACGCC GAGGCCGTGG TAAAACAAAT GGAAGAGAGA
GCGTCGAGGA AGCTAAGGTC TCTTCCTCCA CTTCCCGCGG TAATACCCCC TTACGTTGTG
GAGAGATTGT CCTATCTCGA CTCGTTAAAG GCAGTGATAC TGGATAGGAG GTATGCGAGA
AGGCTCGCGG TCCTAATCCC GATGTGGTTC TTTGGCTACA TGACAGTTTA CGTGTTAGCA
GCAGGATTAA CCACTATCCT GGCGTCCCTA GGATATCCTC CGCCCGAGGC CGGTATCATT
GCCTCCTTTG GGGATATAGG GTTCATCCTA TGCGCAGTAA CCATCATGTT GGTTGGGGAT
AAGATGGAGA GGAGCAGGTG GACTGCAATC TCAGTTCTCT TTACCATAGT GGGGGGCGTG
GTGATAGCGT TAGCGAAGAC CAACTTACCG TTATCTTTCC TGGGATCGTC AATACTGTTC
TACGGCTTCA ATCTATGGGT TCCAGTGTCA TATGCGTGGA GCGCTGAGAG TTTTCCAACA
AGGGCTAGGG CGACAGGTTT CGCCCTCACC GACGGGCTGG GGCATATAGG AGGAGGAGTA
GGGACAGTTG TTGTAGCGTC TTTTGTGGCG TCCCTAGTGT CCAGTGGTGT CACCACGGGA
TTGGCAATTG AGGTTTTCCT GCTCATAGCG TCCTTCCAGA TAATCTCAAC AGTGATTGCA
GTATCCCTAG GACATAAAAC AGCTAATAAA AGGTTGGACG AAATATCTCC GTGA
 
Protein sequence
MNNSPIMAEK SVKAGEIIAR MDRLPIWSLS YIFIGILGMG FLFTFFDIFD INVSFIQTSL 
TIFHVSSPSS PEIGVLLGPA VLLNLVGYIV GSLLLSPLSD RIGRRNMLMI TMAITGLGSL
YNALVNDYSN FLLARTITGI GVGADLAVVN TYIGEVAPLN GRAKYTSFVF LFSTLGAGLG
LWLGLLLTTP PAPFPLGLPF ALGGSGFLAV NGWRVMYGIG ALLALIGLLL RFNLPESPRW
LISRGRIADA EAVVKQMEER ASRKLRSLPP LPAVIPPYVV ERLSYLDSLK AVILDRRYAR
RLAVLIPMWF FGYMTVYVLA AGLTTILASL GYPPPEAGII ASFGDIGFIL CAVTIMLVGD
KMERSRWTAI SVLFTIVGGV VIALAKTNLP LSFLGSSILF YGFNLWVPVS YAWSAESFPT
RARATGFALT DGLGHIGGGV GTVVVASFVA SLVSSGVTTG LAIEVFLLIA SFQIISTVIA
VSLGHKTANK RLDEISP