Gene Msed_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0467 
Symbol 
ID5105463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp421881 
End bp422960 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content45% 
IMG OID640506373 
Productmajor facilitator transporter 
Protein accessionYP_001190568 
Protein GI146303252 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0594986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGA CACTCCTGGC ACTGGCAATG GGTGGTTACA CAGACGGATT TGATCTCCTG 
ATTATTGCAG GAGTACTGGG AGAAGTACTT AAGGTCTTTA AACCTACACG GCTCGAGACT
GGACTCTTAG TTTCAACTTC TTTTCTAGGA TCAATTGTAG GGGCAATTAT CCTTGGGTTA
GTGTGTGACA TCATGGGTAG GAGAAAGAGC TACCTCATCT CGTTAATCCT TTTCATTATA
GGGGCTCTAA TTAGTGCCAC TGCACAGGAT TACGGCTCAC TTGTTCTTGG AAGGTTGTTG
GTAGGTCTAG GAATTGGAGG TGAAATTCCG TCCTCAACCA CCCTTCTAAC GGAGATTTCG
AAAAAATGGG TAGGTCTTAT CTTCGCCTCT TGGGCCATAG GAGCACTCAG TGCGACAATA
ATACCCTTTT TCGTATATCC CTGGAGGATT GCGTTGTTGC TAGGAGCGGT TCCACCGCTT
ATTGCCGTGG CCCTCCACAG GTGGATTAGG GAAAGCGAAG TTTGGCTCAA TTCGTCCAGG
ATTAATAACG CCTCGAAGTT TACAGTTAAT AAAGCCTCAA GTATAGCAAC CTTGGTCACC
GGGTTAAGTC AACTAGTCCT CACCATGGTC TTAGCATCGT TTGCTGTTTA TGTACCTGGA
TACTCCATGG GAACTCAGTT AATCAATTGG ACACTTTTCG CGATAGGTTC AGTCCTGACT
ATCTCGGCAC TCAGGAGAAA GCACATTCTT TTCCTTTCCT ACTTACTAAT AGGGGTGTTC
CTTGTTACAT ATTCCGTCAC CGGCTTGTTC AGTGCAATTG CCTTAGTCTG GTTCTTCTCG
TGTTTAGCCT TCGGATTCTC GTTCTTGTAC GTGGGAGAAC TTAGAAGCGC CACCGATAGA
GGAACCTTCA ATGGATTCCT CTTTTTTATG GGAAGGTTAG GAGGAGCCAT AGGTACGTTC
TCTTATCCAC TTCTGAGATA CGAGTTGAAA GAGGTGCTCC TAATGATTTC CCTTTCCCTT
ATGGTCCTAA GTCTTATCAT ACCATTTTTA GAAGAAACTA GGATAGAAAA GGTAGAGTGA
 
Protein sequence
MKRTLLALAM GGYTDGFDLL IIAGVLGEVL KVFKPTRLET GLLVSTSFLG SIVGAIILGL 
VCDIMGRRKS YLISLILFII GALISATAQD YGSLVLGRLL VGLGIGGEIP SSTTLLTEIS
KKWVGLIFAS WAIGALSATI IPFFVYPWRI ALLLGAVPPL IAVALHRWIR ESEVWLNSSR
INNASKFTVN KASSIATLVT GLSQLVLTMV LASFAVYVPG YSMGTQLINW TLFAIGSVLT
ISALRRKHIL FLSYLLIGVF LVTYSVTGLF SAIALVWFFS CLAFGFSFLY VGELRSATDR
GTFNGFLFFM GRLGGAIGTF SYPLLRYELK EVLLMISLSL MVLSLIIPFL EETRIEKVE