Gene Msed_0255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0255 
Symbol 
ID5103875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp215235 
End bp216422 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content46% 
IMG OID640506161 
Productmajor facilitator transporter 
Protein accessionYP_001190356 
Protein GI146303040 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.299198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCGT ACCTACACGC AACACTTGCC TCCACAATCG CATGGGCAGG AAATATTTAT 
GACCTGCTAA TTATTACTTA CGTTTATCAA TATATTGAAA GCACATTTCA CATAGGCTAT
GTAATGGTTT CCCTACTCTT TTCCTTTGGA CTATTGGGGA GGGTGGTTGG AGGTACACTC
TTTGGAAGGT TTTCGGACAA GTACGGAAGG AAACCTGTCC TGATATTCAC TACGTTGGGG
TATTCGTTGT CTCATGGGAT TATGGCCTTT TCCCCAAACG TGATAGTACT TTTCCTGGCG
AGACTGTTTG AGGGAGTATT CATGGGAGGA GAGTGGACTG CTGGAACAGT AATAGCCTAT
GAGAGTGCCC CCGTATCAGT CAGGGGAATA CTTACAGGGA TAGTCCAGTC TGGGTATGGA
ATGGGTTACG CGCTCACAGG GGCAATGTAC ATCTACTTTT CACCTCTCAT TTCGGAGGAT
TGGAGAATCT TTCTAGCCAC TGGAACGTTT CCCCTCCTTC TGGTACCTTA CATGAAACTG
AAGGTTCCAG AATCCAAACC CACAAGGGTA TCCAAGGTAA AAGTTGAGTA CAGGGATTAC
CTAAACCTCA TCCTTAAGTC TACCTTGGCA ATGTCAGGGA TGTTTGTAGC CTACTTCTCT
GTTTTCGGAA ATTATCCAAC CTTTGCGGAA AAATTAATTG GAATCTCTCC CTCTACTTTA
GGGTTAACAC TACTGATCTC CAACGTAGGA CTTGCAATAT CCTTTATCGT GTTTGGTCGC
CTTGCAGACA GGATAAACGT AAGGAAACTA ATCCTTTCTG CCCTGGTTAC GCTCACAGTA
TCCCTCTTCT TTACCGTTCC TGGATTCATC AACCTAGGCC CTCTTGCCTC AATCATCTCT
ACGATGGTTT ATGCCTCATC GTGCGGATTC TGGCCCCTGA TACCGCTTCT CCTTGCCCAC
TCCGTTCCCG TGGAGGTTAG GGGGCTCTTG TCGGGAATGT CCTATAACAT AGGCGGGCTT
GTGGGAGGCA TTGCGGAAGT CATCACGGGG ATAGCAATGC AATACATGGG TATCTTGGGA
ATGGCCAAGA TAATCGACAT CATTAATCTG GTTGCTCTCA TCACGGTGTT TATTTCAGTC
ATTACATGGC CAAGGGCAGC CATCCATACT TCGAGCCATA ATGTATAA
 
Protein sequence
MKPYLHATLA STIAWAGNIY DLLIITYVYQ YIESTFHIGY VMVSLLFSFG LLGRVVGGTL 
FGRFSDKYGR KPVLIFTTLG YSLSHGIMAF SPNVIVLFLA RLFEGVFMGG EWTAGTVIAY
ESAPVSVRGI LTGIVQSGYG MGYALTGAMY IYFSPLISED WRIFLATGTF PLLLVPYMKL
KVPESKPTRV SKVKVEYRDY LNLILKSTLA MSGMFVAYFS VFGNYPTFAE KLIGISPSTL
GLTLLISNVG LAISFIVFGR LADRINVRKL ILSALVTLTV SLFFTVPGFI NLGPLASIIS
TMVYASSCGF WPLIPLLLAH SVPVEVRGLL SGMSYNIGGL VGGIAEVITG IAMQYMGILG
MAKIIDIINL VALITVFISV ITWPRAAIHT SSHNV