Gene Msed_1512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1512 
Symbol 
ID5104041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1474931 
End bp1476334 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content45% 
IMG OID640507400 
Productgeneral substrate transporter 
Protein accessionYP_001191593 
Protein GI146304277 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.925564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.774713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCTT TCAAGTCTTT GGATAGTGTG AAGCTTAACT TCAATCACAT CAAGATCTGG 
TACACATCTG GGATGGGTTT CTTCACAGAT GCCTATGATC TTTTTATTAT AGGTGCGATC
CTTGACATAT TCAACGCCTA TCACTTGCCT GGCTTTACCT TAACACCTTT GTATGAAGGT
CTCTTAGCAT CTTCAGCAAT CTTCACTGCA ATAATTGGGC AGTTGGTCTT TGGTGTACTA
GGGGACCTTA TTGGCAGAAA GACTGTTTAT GGTGTGGAAG CCTCTTTACT GACTGCTGGT
GCGGCTTTAT CTGCTTTCGC ACCTAATGTA CTTTGGCTCA TAATTTTCAG ATCTATAATG
GGAATTGGTA TTGGAGGTGA TTATCCCATC TCAGCTACCA TAATGAGTGA GTACGCAAAC
GTAAAGGATA GGGGTAAACT TGTGGCCTTG GTCTTTGCAA ACCAAGGAAT TGGGAGCCTA
GTTGCGGTAG CAGTAGGTGC CATTTCAGCA TTTACCTTGC CTCCAGATCT TGCCTGGAGG
GTAATGGCCT TCGTTGGGGC TATACCGGCA GCTACAGTCA TTTATCTGAG AAGAAAGGTC
CCAGAAACGC CTAGATACTC AGCACTCAAG GGAGATACAA ACAATGTGGA GAAATCTGTT
GAGTTTGTGG CTAAGGATAC ACCCAAGACC GAAGTGAGAA GGGTTAGAAT ACAGAGAAAG
AGCGTATCTG AGTTCTTCTC GAAGTACTGG TTACTCTTGC TTGGAACAGC AGGAACTTGG
TTTATCCTGG ATATAGCCTT CTATGGAACA GGTATTTACT CCGGTCCCAT AGTTTCCTCG
GTACTTGGGA AGCCGGCATC AGTGGGGCAG GAAATAGTGT ACGCAGGCAT TCCATTCATG
GTGGGTTTCT TTGGTTACTT TACTGCAGTT GCACTAATGG ATAAGCTAGG TAGAAAACCC
ATACAGACCT TAGGTTTCGT AATGATGGCA GTGCTTTATG GAGTGGTAGC GTTGCTGGCT
GTAGCTAAGG GGGCTAAATT GGAAGGATTC TTGATTCCTT CTACGCAAGC GTTTGCTCTA
TATGCCCTTT CGTACTTCTT CATTGACTTT GGTCCCAACA CTACAACCTT CGTTATTCCG
TCTGAGGTAT ATCCAACCAG TTATAGGACA ACTGGACACG GTATTTCAGC AGCAGCTGGG
AAGACTGGTG CTGCCATAAC CACCTTCTAC TTCCCTACAC TACTATCCTC ACTAGGAATA
AAGGGCATAT TGGAAATGCT TGCAGTGATA AGCGTCGTGG GTGCAGTTCT CACCTTGATA
GCCGTTAAGG AACCTAAACT CAAGAGTCTT GAGGAGGTTT CCCAGGACTC CGTTGTACTT
GAGCAATCTC AGGAAACTAA ATAA
 
Protein sequence
MEPFKSLDSV KLNFNHIKIW YTSGMGFFTD AYDLFIIGAI LDIFNAYHLP GFTLTPLYEG 
LLASSAIFTA IIGQLVFGVL GDLIGRKTVY GVEASLLTAG AALSAFAPNV LWLIIFRSIM
GIGIGGDYPI SATIMSEYAN VKDRGKLVAL VFANQGIGSL VAVAVGAISA FTLPPDLAWR
VMAFVGAIPA ATVIYLRRKV PETPRYSALK GDTNNVEKSV EFVAKDTPKT EVRRVRIQRK
SVSEFFSKYW LLLLGTAGTW FILDIAFYGT GIYSGPIVSS VLGKPASVGQ EIVYAGIPFM
VGFFGYFTAV ALMDKLGRKP IQTLGFVMMA VLYGVVALLA VAKGAKLEGF LIPSTQAFAL
YALSYFFIDF GPNTTTFVIP SEVYPTSYRT TGHGISAAAG KTGAAITTFY FPTLLSSLGI
KGILEMLAVI SVVGAVLTLI AVKEPKLKSL EEVSQDSVVL EQSQETK