Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0846 |
Symbol | |
ID | 5105206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 777321 |
End bp | 778649 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506751 |
Product | general substrate transporter |
Protein accession | YP_001190944 |
Protein GI | 146303628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCTA CAAACCCTTT AAGGGATTAC GAGGAAAAGA AGCTCGGAAT TTTTCATCTT AAGACTACCC TTCTCGCAGG GGCGGGACAG ATAGTTGATG GTTACGATCT AACCGCAGCT GCTCTGGTGC TCTCCCTCGT GGAGGCCTCG TTCACGGGTT ACAATCTTGC AGAGGTTTCT CTCATTCTTT TCCTTAGTAT AACCCTTGGA AACTTAGTTG GAGGACTAGT TTTTGGGTAT CTGGTTAAGC ATGGAAGGAA AAGGTTTTAC GGGATAGATG CCACTCTCAT GACACTCGGG GCCCTCCTCC AGGCATTCGT TCAAGACCCC TATCAGCTGG CGATTCTGAG GTTCCTCCTA GGGGTAGGAA TAGGTGCAGA TTACGTTCTA TCTCCTCTGA TTAACGCGGA GTATGCCAAC AGAAAGGATA GGGGTAAGCT CCTGGCACTG TCGGGCGGTT TCATGTGGAA CGTTGGGGCC CTAGTTTCGG TCGTGGTTAC CTTAGCCGTG TCACAGGCCG TTCCGCAGGA CATGTTGTGG AGAATAGTCC TAGCCTCAGG TGCAATTCCG GCCATAGCTG TCATATACGG GAGGAGGAAG TTCCCTGAGA CACCGCAATA CCTCGCTTTC GTAAAGGGAG ATTCCAAGGA ACTTGAGGAG AAGTACAACC TATATGCTAG CAATCTCAGC CTAGGAAAGG TGGCCATCAA GGCCTTCCTC CCAACCTTGA TATTTGCCTC AGTGACATGG TATCTCTTCG ACGTTTCTGC GTATTCAGGA GTTTTCTTTG GACCAAGTGT TATAGCCAAG GATCTAGGTA TTAATGGCCT CCTGTTCGAG CTAATTATCC TAGGAGGCTT CGCAGTTCCG TGGAACCTGG TGAGTGCAGG CCTAAACGAC AGACTTGGAA GGAGAGCCCT GCAGGCAATA GGTTTCGCAG GAATGGGAAC GTTCACTCTC CTCTTTGCTT TCCTCTTCGG CAGAACTCAG GCATTAGAGT CGCTTCTCCT CTACGGTTTT AGCACGGTCT TTTCTCAGCT GGGACCTGGT ACTGTGGTCG GATTCTGGGG AGTTGAACTA TTCCCAGCTG AGATAAGGGG TATAACCCAG GGGGTCACGG TCATGTCCGG AAGGCTCGGG GTGCTAACAA CAACCTTCCT ATTCCCGCTT ATCATTTCCA GTTACGGGAT AGTTACCACG ATGATGATTT TAGCAGGTCT ATCCTTCGTG GCGGTGTTTG CGACCCTGCT ATTGCCTGAA CCCAACCAGG TTAGCCTCGC AGAGAGGGAG CTTCAGCTTA GGGGTTTGCC CAACCTGGAG GAGAAGTAA
|
Protein sequence | MNPTNPLRDY EEKKLGIFHL KTTLLAGAGQ IVDGYDLTAA ALVLSLVEAS FTGYNLAEVS LILFLSITLG NLVGGLVFGY LVKHGRKRFY GIDATLMTLG ALLQAFVQDP YQLAILRFLL GVGIGADYVL SPLINAEYAN RKDRGKLLAL SGGFMWNVGA LVSVVVTLAV SQAVPQDMLW RIVLASGAIP AIAVIYGRRK FPETPQYLAF VKGDSKELEE KYNLYASNLS LGKVAIKAFL PTLIFASVTW YLFDVSAYSG VFFGPSVIAK DLGINGLLFE LIILGGFAVP WNLVSAGLND RLGRRALQAI GFAGMGTFTL LFAFLFGRTQ ALESLLLYGF STVFSQLGPG TVVGFWGVEL FPAEIRGITQ GVTVMSGRLG VLTTTFLFPL IISSYGIVTT MMILAGLSFV AVFATLLLPE PNQVSLAERE LQLRGLPNLE EK
|
| |