Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1117 |
Symbol | |
ID | 5103590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1044353 |
End bp | 1045738 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640507011 |
Product | major facilitator transporter |
Protein accession | YP_001191204 |
Protein GI | 146303888 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.316307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTATCA AGAGTAATCC AGACAAGGAT AGTGAAGGAG TAAAAAAGAC TCATAACCCA TTTGATGCAT TGGATAATAT TCGCTCCAAT AGAAAGATAT TTTTCATTCT ATTGGTAGCT GGAATAGGAG CATTTGCCGA TTATTATAAC ACTGCGGGGA TATTGACACC GTCCAGCTTG AGTTATTTAA AGTATTTCTC TGTGACGACG TCAGTCTATG GCCAATTCGT GTTCTTTCTT AATATAGGTA TATTGATAGA CGCAATAACT TGGGGACCAC TGGTGGACCT CCTAGGCAGA GCTAGAATGT TCTTCGTGGA TTTGGTTGTT ATGCTTGTCT TTGCGACCTT GTCCATATTT GCCACTAACT TCCACGAGTT TGAGCTATTC AGGATACTAA CAGGTTTTGC AATAGGGGGC GATTACGGCG CCGCACTGCC GTTTCTGTCT GAGTTTGCGC CAAAGGCAAT TAGGGGAAGA ATATTGGCGC TGTTTTGGGT ACTGGCGCAA AGCGGACTTG TGGTCGGAAC ACTGGTGAGC TACTACTTTT TGTCAATATC GTCGATCTCA CCAGAAATCT GGAAAATTAT ATTCGTGACA GGCTTGATAC CCATCATTAT CGGAGCAGGA TTAAGGTTTA CTGTCCCAGA GTCTGCTAGA TGGCTTCTAT TCAAGGGAAA AACTGATAAG GCAGTTGAGG CGGTAAGGAA GGTCACGGGG AGTTCACCCC AAGAGACGGT AAATAATATC ACTAGTTCAT TCAATATTCG TAAAAATATA CCTCTGTTAC TTCTCCTCAT TGTCCCAACG TTTATTGGGA TATATGCTAC CGCTATGCCC GCAGGACTAT CCACCTATTT CTTCCCTTAC ATCACGCAGA GTTTGGGATT GTCTAAGCTT ACTTCAGTGC TGTTGCAAGT CCCAGTTGTC TGGGTATCAG AAATCGTATT TACTTTAATT TTAGCTCTTA TTACAGATAA AATTGGTAGA TTAAATTCAC TGATAATAGG AGGTACTGTA TTAATAATTT CTACGTTACT GATAATTCCG CTAACGCATA ATGTGGATGC CCTACTTCCC TTGATATTCA TCTCCAATGG GAGTAGCGTA TTTTCTCAGA CAATCATTAT AAATTGGGGT GCAGAGCTAT ATCCTACAAA CATGAGAGGT ATAGCATCTG GAATTAACAT AATGGCATTT AGGCTATCCC TGGCGTCAAC AGGGTTCATT GAACCTGTGC TGTTATCATT AGGTGGTGTT CCGGCTCTAT ACGCTTTCCT AGGATTATTA TCTCTTATTG GAGTTATGAC CGTAATGATA TTGGTTGGTA AGAGGGGTTC GGTGGAAGGA AAGTCGTTAG AAGAAATAAC TGATAGATTT AGATAA
|
Protein sequence | MVIKSNPDKD SEGVKKTHNP FDALDNIRSN RKIFFILLVA GIGAFADYYN TAGILTPSSL SYLKYFSVTT SVYGQFVFFL NIGILIDAIT WGPLVDLLGR ARMFFVDLVV MLVFATLSIF ATNFHEFELF RILTGFAIGG DYGAALPFLS EFAPKAIRGR ILALFWVLAQ SGLVVGTLVS YYFLSISSIS PEIWKIIFVT GLIPIIIGAG LRFTVPESAR WLLFKGKTDK AVEAVRKVTG SSPQETVNNI TSSFNIRKNI PLLLLLIVPT FIGIYATAMP AGLSTYFFPY ITQSLGLSKL TSVLLQVPVV WVSEIVFTLI LALITDKIGR LNSLIIGGTV LIISTLLIIP LTHNVDALLP LIFISNGSSV FSQTIIINWG AELYPTNMRG IASGINIMAF RLSLASTGFI EPVLLSLGGV PALYAFLGLL SLIGVMTVMI LVGKRGSVEG KSLEEITDRF R
|
| |