Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1780 |
Symbol | |
ID | 5104780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1722896 |
End bp | 1723996 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507678 |
Product | tungstate/molybdate binding protein |
Protein accession | YP_001191859 |
Protein GI | 146304543 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0725] ABC-type molybdate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATAGCCT TGAACAAGGG TTTAGTTATT GGTATAGTTG TGTTAGTAAT AATTGTAGTA GGAGTTTTAG CATATGTGGA GATTGGAATG CCGAAACACG TAACTCCCTC GCCAACTACC AACACAACTA CTACGACCAC GACGACTACC ACTACCTCTC AAACCCCGCT CTACATATGG GTAGCTGATG CCTACACGGC AGAGGCCCAA TATCTAGGTT CGTCGTTCCA GAACGCCACA GGCATAACTG TGCCCACTCC CAAGGGAGGC GGATCCTTCG GGCTAGCGAG GGAGATCGCG TCCGAGGGTC CCAACGCCCA GGTTAGCGTC TTCCTTCCTG TGGCCCTCTC AGCTGCATCT CCCTCATACC TGGGAAATTA CTCCTCCGGC TGGGCAATAG CCTTCGTAGC AGACCAGCTC ACCATAGCTT ACACTAACTC CAGCATAAAC AATCCGTACG CACAAGAAGC CCTCAATTAC GCCAAGGAGG CAGAGGCCGG AAACACGTCA GCGTGGTACA ACTTCTTCCA GATACTGAGT AGCGGTAAGG TTAAGGTTGG TATCTCAGAT CCCAACACAG ACCCCGCTGG TTTTAGGGCG TGGATTACCC TAGAGTTAGC TGGATATGAA TATGCCAACA ACACTTTCCT ATTCTACAAT GAAATGCTGA ACAACAAGGG TAACGTGACC GCGTCCAACG CAGCTGAGCT GGTATCGCCG TTGGAGGCGG GACAGATCAA CTTCCTATTC ATTTACAAGT CGGCTGCCAT AGCCAAGGGG TTGGAGTATA TACAACTTCC CAATCAGATT AACCAGGGAG ATCCTAGTTA CTCGAGCCTT TACTCTAAGT TTGAGTACAA TCTATCCACA GGACCGGTTT ACGGTTCCCC CATCTACCTC TTCATCACTG TACCCAAGAA CGCAAATAAC CAGGCCGAGG CTCTTGAGTT CGTGACGTAC GTGATTGAGA ACTCCCAGTC CCTAAGTAAG TTCGGCCTAC TTCCGCTGTC TCCTGCAATC CTGTTCAACT CTACGGCAGT TCCTCCTCAG ATAGCCTCTT TACTCTCCCA GGGTAAACTG GTTGAAGGAG GTACGCTTTG A
|
Protein sequence | MIALNKGLVI GIVVLVIIVV GVLAYVEIGM PKHVTPSPTT NTTTTTTTTT TTSQTPLYIW VADAYTAEAQ YLGSSFQNAT GITVPTPKGG GSFGLAREIA SEGPNAQVSV FLPVALSAAS PSYLGNYSSG WAIAFVADQL TIAYTNSSIN NPYAQEALNY AKEAEAGNTS AWYNFFQILS SGKVKVGISD PNTDPAGFRA WITLELAGYE YANNTFLFYN EMLNNKGNVT ASNAAELVSP LEAGQINFLF IYKSAAIAKG LEYIQLPNQI NQGDPSYSSL YSKFEYNLST GPVYGSPIYL FITVPKNANN QAEALEFVTY VIENSQSLSK FGLLPLSPAI LFNSTAVPPQ IASLLSQGKL VEGGTL
|
| |