Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1807 |
Symbol | |
ID | 5105370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1748666 |
End bp | 1749916 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640507706 |
Product | hypothetical protein |
Protein accession | YP_001191885 |
Protein GI | 146304569 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTATA ATCTTCTTCC CTTGATTCTT TTATCCCTAC TAGTAGCACC ATTGTTGGCC ATGGGTTCAG CTCCAGCTAT TACAGTAACT ACTCAACCAG TATATCATCC TGGACAGACT GTATTCATTT CAGGAGTTAC TTCTCCTAAC ACTCTTGTGG GCATAACAAT CTATAATCCG CAGGGGAAGG CAGTTTACTC CAACACCACC ACAAGCGGAC CTAATGGAGA TTACTCATTG AAAGCCTTCA CATTCCCGCT ACAGGAGAGC ACAACGTTCC CATTCGGGAC TTATACCGTT CAGGTTGGGA CCCAGACTGG GTTCACCAAC TCCACAACGT TCCAATTCCT ACCTCTAACC GCTACGGTAA ATGTATTAGT GGTTAATCCA CAGGGTGTAC CAATTCAGGG AGCCACTGTT ACTGCAGATA GTGTGACTGC CACTACCAAC GCTTCAGGTC AGGCAGTTTT GAACCTTCCC ACGGGGACAT ATACCTTGAA GGTAGTTCCC CCATCTCCAT ACTCGCCTGC CAGCGAGAAC ATAACAGTAA CAGCACCTAA TACATACTCA TTCAAGATAA CTGTACAGAT CCAGGAACTG GCTCTCCAGG TAGTTAGCGC TACTTCACCC AACGTTAATC TGAAGGATTT GACAAGCGGG ACAAGCATAA CAATGATTGG TGGCACTACA CTCACATTGA TGAGCATGGT AACATTCGCA GGTCAGCCAA TAAGTACGGC TACTGTGACT GCTATGTATA ACGGGACCAT GTATAATGCA ACCTATATGA ACGGTTACTA CGTTATCACC ATATCAGTTC CAAACACGCA GTATGAGACT GACTTAGTAA TACAGGCGAC TTACTCTGGG ATGCAATCCA ACACCGTAAC CCTTCCGTTA ACCGTGAATG TTAATGAACA GGCTATAATA GCAAGTCTTA ACTCGACAAT ACAGTCTCTC GAATCTCAAA TAAGCTCACT AAGTAGCACT GTCTCCACTC TCAGTAGCTC AGTCACAAGT TTGAGCAATA CTGTCTCTAG TTTGAGTAGC ACAGTCTCTA AGCTCAACGG CACTGTGGCA AGTCTACAGA GTTCCGTGTC CACACTGTCT AGCGAATACA GTACTCTAAA TAGCAGGGTT AATGCACTTT CAGGTCTCTC TGGCACAGTG GATATTGCTC TAGCTGTCAG CATTATAGCC ATAATCATCT CCATAGTAGT CCTTATCCTT GTCTTTAGAA AGATAAGTTA A
|
Protein sequence | MKYNLLPLIL LSLLVAPLLA MGSAPAITVT TQPVYHPGQT VFISGVTSPN TLVGITIYNP QGKAVYSNTT TSGPNGDYSL KAFTFPLQES TTFPFGTYTV QVGTQTGFTN STTFQFLPLT ATVNVLVVNP QGVPIQGATV TADSVTATTN ASGQAVLNLP TGTYTLKVVP PSPYSPASEN ITVTAPNTYS FKITVQIQEL ALQVVSATSP NVNLKDLTSG TSITMIGGTT LTLMSMVTFA GQPISTATVT AMYNGTMYNA TYMNGYYVIT ISVPNTQYET DLVIQATYSG MQSNTVTLPL TVNVNEQAII ASLNSTIQSL ESQISSLSST VSTLSSSVTS LSNTVSSLSS TVSKLNGTVA SLQSSVSTLS SEYSTLNSRV NALSGLSGTV DIALAVSIIA IIISIVVLIL VFRKIS
|
| |