Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1512 |
Symbol | |
ID | 5104041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1474931 |
End bp | 1476334 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640507400 |
Product | general substrate transporter |
Protein accession | YP_001191593 |
Protein GI | 146304277 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.925564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.774713 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCTT TCAAGTCTTT GGATAGTGTG AAGCTTAACT TCAATCACAT CAAGATCTGG TACACATCTG GGATGGGTTT CTTCACAGAT GCCTATGATC TTTTTATTAT AGGTGCGATC CTTGACATAT TCAACGCCTA TCACTTGCCT GGCTTTACCT TAACACCTTT GTATGAAGGT CTCTTAGCAT CTTCAGCAAT CTTCACTGCA ATAATTGGGC AGTTGGTCTT TGGTGTACTA GGGGACCTTA TTGGCAGAAA GACTGTTTAT GGTGTGGAAG CCTCTTTACT GACTGCTGGT GCGGCTTTAT CTGCTTTCGC ACCTAATGTA CTTTGGCTCA TAATTTTCAG ATCTATAATG GGAATTGGTA TTGGAGGTGA TTATCCCATC TCAGCTACCA TAATGAGTGA GTACGCAAAC GTAAAGGATA GGGGTAAACT TGTGGCCTTG GTCTTTGCAA ACCAAGGAAT TGGGAGCCTA GTTGCGGTAG CAGTAGGTGC CATTTCAGCA TTTACCTTGC CTCCAGATCT TGCCTGGAGG GTAATGGCCT TCGTTGGGGC TATACCGGCA GCTACAGTCA TTTATCTGAG AAGAAAGGTC CCAGAAACGC CTAGATACTC AGCACTCAAG GGAGATACAA ACAATGTGGA GAAATCTGTT GAGTTTGTGG CTAAGGATAC ACCCAAGACC GAAGTGAGAA GGGTTAGAAT ACAGAGAAAG AGCGTATCTG AGTTCTTCTC GAAGTACTGG TTACTCTTGC TTGGAACAGC AGGAACTTGG TTTATCCTGG ATATAGCCTT CTATGGAACA GGTATTTACT CCGGTCCCAT AGTTTCCTCG GTACTTGGGA AGCCGGCATC AGTGGGGCAG GAAATAGTGT ACGCAGGCAT TCCATTCATG GTGGGTTTCT TTGGTTACTT TACTGCAGTT GCACTAATGG ATAAGCTAGG TAGAAAACCC ATACAGACCT TAGGTTTCGT AATGATGGCA GTGCTTTATG GAGTGGTAGC GTTGCTGGCT GTAGCTAAGG GGGCTAAATT GGAAGGATTC TTGATTCCTT CTACGCAAGC GTTTGCTCTA TATGCCCTTT CGTACTTCTT CATTGACTTT GGTCCCAACA CTACAACCTT CGTTATTCCG TCTGAGGTAT ATCCAACCAG TTATAGGACA ACTGGACACG GTATTTCAGC AGCAGCTGGG AAGACTGGTG CTGCCATAAC CACCTTCTAC TTCCCTACAC TACTATCCTC ACTAGGAATA AAGGGCATAT TGGAAATGCT TGCAGTGATA AGCGTCGTGG GTGCAGTTCT CACCTTGATA GCCGTTAAGG AACCTAAACT CAAGAGTCTT GAGGAGGTTT CCCAGGACTC CGTTGTACTT GAGCAATCTC AGGAAACTAA ATAA
|
Protein sequence | MEPFKSLDSV KLNFNHIKIW YTSGMGFFTD AYDLFIIGAI LDIFNAYHLP GFTLTPLYEG LLASSAIFTA IIGQLVFGVL GDLIGRKTVY GVEASLLTAG AALSAFAPNV LWLIIFRSIM GIGIGGDYPI SATIMSEYAN VKDRGKLVAL VFANQGIGSL VAVAVGAISA FTLPPDLAWR VMAFVGAIPA ATVIYLRRKV PETPRYSALK GDTNNVEKSV EFVAKDTPKT EVRRVRIQRK SVSEFFSKYW LLLLGTAGTW FILDIAFYGT GIYSGPIVSS VLGKPASVGQ EIVYAGIPFM VGFFGYFTAV ALMDKLGRKP IQTLGFVMMA VLYGVVALLA VAKGAKLEGF LIPSTQAFAL YALSYFFIDF GPNTTTFVIP SEVYPTSYRT TGHGISAAAG KTGAAITTFY FPTLLSSLGI KGILEMLAVI SVVGAVLTLI AVKEPKLKSL EEVSQDSVVL EQSQETK
|
| |