Gene Msed_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0463 
Symbol 
ID5105459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp417905 
End bp419251 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content48% 
IMG OID640506369 
Productmajor facilitator transporter 
Protein accessionYP_001190564 
Protein GI146303248 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0213055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTG CAGGAAGAAT AGAAAGGCTT CCCTGGACCT CATTCCACAC CAGGTTGCTG 
GTGTTGTTAA GTCTAGGGGA GTTCTTTGAG CTATACGACC TATTCGTGGG GGGTTTCGTG
GTTGCCCCTA TCTCTGCATT TTATAAGGTA TCCACGGCGG TGGCAATATA CTACAACATT
GCCGTGTTCT TCCTAGGCGC ATTTCTGGGA GCTATAATCT TCACATATGT GGGAGATGCA
CTCGGCAGAA GAACATCGCT AATCCTGAAC ATGGTCATTG CAGGTATTGG TCTATTGCTA
ACCCCGTTTT CACCCAGCAT ACAGGTGCTG GGTACCTTAA GGTTCATTAC AGGCCTAGGA
GTAGGTCCGG AGGCCCTCAT TGTCCTAGAC GTTTTAATCA CGGAGTTCTT TCCTTCGAGG
ATTCGGGGGA GAGCCCTTGC CATAGCGTAT ACCGCGTCCT GGACAGCTCC CATAGTGGTG
GCGATTCTAG CATATCTCCT TATTCCCCAT GTATACCTTA TTCCTGGATG GAAATGGTTA
TTCATTATAG GAGGCCTGGG GATATTCACC ATAATACCCT TCAGGTTCCT TATTCCAGAG
TCCCCAAGAT GGCTAGAGTC CAAGGGAAGA GTTGACGAGG CCGACAAGAT AGTCAGTAAC
ATGGAGAGTA TAGCAATGAG GGAGAAGGGG TCATTGGAGG AGCCTTTACA GGTTCAGGTA
ATCACGTCCC AAAGGGTGAG AATAAGTGAA CTTTTCAGCA ACGAGTATAG GAAAAGGACG
GTAATGTTAT GGATCTTCGA GTTCCTTCAG GCCGGGGTCT ACTACGGTTT CGCCTCACTG
GCTCCATCTG TTCTCGCCAG CAAGGGTTTC ACACTAGTTC ATACGTTAGA GTACTCCATG
TTGATATACA CGTCCTATTT CCTTAGTTCT CTTGCCTCAG TCTTCATCAT TGATAGTCAA
AGATTTGACC GGAAATGGCA AGTGAGTATA GTGATGCTAC TCATGGGGAT AACAGGGTTG
GCCTTTGGGT TCGCTATCAC GCCAGTGGAA GTGGTGGCAA CTGGGTTCCT GTTTGGGTTC
CTTTCAAACA TATTCTCCAA TGCATTCCAT CAATACGGAG CTGAACTTTA CCCCACTAGA
ATGAGGGCCT TCGCTGACGG AGTACAATAC TCCCTTAGTA GGCTGGGTAA CTATGTGTGG
CTCAGCGTGT TGCCTCTGGT TCTGGCCAAG TATGGCGCCG TCGGCATGTA CACAGTGGTG
TTTGTGATGG CTTTGATAGT TGCTCTAGAT GTTGGAGTCC TAGGTCCAAG AGCATCCAGG
ATTGAATTGG AAGAATTGTC CCACTAA
 
Protein sequence
MTVAGRIERL PWTSFHTRLL VLLSLGEFFE LYDLFVGGFV VAPISAFYKV STAVAIYYNI 
AVFFLGAFLG AIIFTYVGDA LGRRTSLILN MVIAGIGLLL TPFSPSIQVL GTLRFITGLG
VGPEALIVLD VLITEFFPSR IRGRALAIAY TASWTAPIVV AILAYLLIPH VYLIPGWKWL
FIIGGLGIFT IIPFRFLIPE SPRWLESKGR VDEADKIVSN MESIAMREKG SLEEPLQVQV
ITSQRVRISE LFSNEYRKRT VMLWIFEFLQ AGVYYGFASL APSVLASKGF TLVHTLEYSM
LIYTSYFLSS LASVFIIDSQ RFDRKWQVSI VMLLMGITGL AFGFAITPVE VVATGFLFGF
LSNIFSNAFH QYGAELYPTR MRAFADGVQY SLSRLGNYVW LSVLPLVLAK YGAVGMYTVV
FVMALIVALD VGVLGPRASR IELEELSH