Gene Msed_1280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1280 
Symbol 
ID5104692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1256285 
End bp1257973 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content53% 
IMG OID640507170 
Productmajor facilitator transporter 
Protein accessionYP_001191363 
Protein GI146304047 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACA AATGGATAGC CTTGAGCAAT ACGACTTTGG GAGTCCTAAT GGCGACGATA 
AACGGCACGA TCACGATCAT CTCGCTTCCC GCCATATTCA GGGGGATAGG GATAAACCCA
CTGGCCCCCT CATCTTTCCA GTACCTTCTC TGGATCCTCA TGGGCTACAA CGTTGTAACC
GCGACGCTTC TCCTTTCCTT TGGAAGGTTG TCTGACATGT ACGGAAGGGT CAGGCTGTAC
AACCTCGGCT TCCTGGTTTT CACCGTGGGC TCTATCCTTC TCTCCTTAAC CTTTGGAAGT
GGGGATATGG CTGGCCTCGA GCTCGTGATA TTCAGGATAA TCCAGGGAAT TGGTGGAGCG
TTCTTGATGG CGAACAGCGC GGCAATCCTC ACAGACGTTT TCCCCGTCAA TGAGAGGGGT
AGGGCACTTG GAATAAACCA GGTAGCTGCG CTGGCAGGAT CGTTAATTGG GCTTATCCTG
GGAGGAATTC TATCGGTCAT AAACTGGAGG TATGTCTTCC TCGTGAGCGT TCCAGTAGGG
GTATTTGGAA CGGCGTGGAG TTACCTGAAG CTCAAGGAGA CCAGTGCAAG GAACAGGGAG
GGGATAGATT GGGTCGGTAA TGCGGTGTTC GGCCTCGGGT TAATCCTTGT ACTAATTGCA
ATGACCTACG CCCTTATGCC CTATGGCTCA GCGCAGACCG GGTGGGGTAA TCCCTTCGTC
ATACTCTCCA TGGTTGCAGG GCTGGGACTT CTGGCCTCTT TTCCCTTTAT CGAGACTAGG
GTCAAGTACC CCATGTTCAG AATGGAACTC TTCAGGAATA GGATGTTCGC TGCGGCCAAC
TTTGCTGGAT TTCTGAGGTC CATAGGCTAC GGAGGTCTCA TGATCATGAT AGTGATCTTC
CTCCAGGGGA TATGGCTCCC GCTTCACGGA TACTCCTACT CTGAAACTCC TTTCTGGGCA
GGGATATACA CGATCCCCCT AATGGTTGGA TTCGTGAGTG CGGGACCAGT GAGCGGTTGG
CTCTCAGACA GGTACGGATC GAGGGGGCTA GCTACTGCGG GAATGGTTCT GGTCGGCATA
GGTTTCTTAG CCCTGACCGC GCTACCCTAC AACTTCAGCT ATCCAGTGTT TGGAGCAATC
ATCTTCATGA TGGGAGTCGG AAATGGGATG TTCGCTTCTC CAAACACTTC CTCGATCATG
AGTAGCGTCC CCGCAAAGCA CAGGGGAGCT GCGTCGGGGA TGAGGTCAAC GCTTCAGAAC
ACTGGACAAA CGGTGAGCAT TGCCATATTC TTCACGATTG TGATCCTTTC CCTGAGTTCG
TCGTTGGGGC CGTCATTGGC TCACGCCCTA GCTCAGGCAG GCGCTCCTCA GCTTTCGCCC
TATGTACAAA AGGTTCCAGT GACCGGGGCC TTGTTCGCGG CCTTCCTTGG ATACGACCCC
GTGAAGGCCC TACTCGGAAC CTTACCCGCT TCAGTTGCCT CCCAGATTCC CAGTAGTGCG
ATCTCGATCA TGGAGCAGAG GACCTGGTTC CCAAGCGCAA TTGCCCCCAG CTTCATGTTA
GCCTTGAGGG AGACGTTCTA CTTTGGGGCT GCGCTCTCCT TCATAGCGGC AGTAGCCTCA
GCCCTGAGGG GAAAAGCTAA AATCCCGGAG GAGGTTGTCC AATATGATGC AGAGAAAACT
AGACGTTGA
 
Protein sequence
MQYKWIALSN TTLGVLMATI NGTITIISLP AIFRGIGINP LAPSSFQYLL WILMGYNVVT 
ATLLLSFGRL SDMYGRVRLY NLGFLVFTVG SILLSLTFGS GDMAGLELVI FRIIQGIGGA
FLMANSAAIL TDVFPVNERG RALGINQVAA LAGSLIGLIL GGILSVINWR YVFLVSVPVG
VFGTAWSYLK LKETSARNRE GIDWVGNAVF GLGLILVLIA MTYALMPYGS AQTGWGNPFV
ILSMVAGLGL LASFPFIETR VKYPMFRMEL FRNRMFAAAN FAGFLRSIGY GGLMIMIVIF
LQGIWLPLHG YSYSETPFWA GIYTIPLMVG FVSAGPVSGW LSDRYGSRGL ATAGMVLVGI
GFLALTALPY NFSYPVFGAI IFMMGVGNGM FASPNTSSIM SSVPAKHRGA ASGMRSTLQN
TGQTVSIAIF FTIVILSLSS SLGPSLAHAL AQAGAPQLSP YVQKVPVTGA LFAAFLGYDP
VKALLGTLPA SVASQIPSSA ISIMEQRTWF PSAIAPSFML ALRETFYFGA ALSFIAAVAS
ALRGKAKIPE EVVQYDAEKT RR