Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1555 |
Symbol | |
ID | 5104000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1511890 |
End bp | 1513143 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507441 |
Product | hypothetical protein |
Protein accession | YP_001191634 |
Protein GI | 146304318 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.044056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAAC AGGACCCATT GGAAATTTTG CTCAAGGAAG AAAATTTACA AAAAATCACT AAACTAATGG ATGTTCTACC TGCGTTCGAG AAGGTCGCTG AGAAACTAAG CGAGATGGAC AAGAAAGGGG AACTAGACTT CATGTTGGAT ATGCTGGGGC AGGTTGTGAG CATTGCTGAT GCGATGCAGA AGGCTGACCT GATGAATACG TTAGTTTCAT TTGGTATGGA TCAGATAGGG AAGGTTCAGG CTCTGTGGCC TCTCCTGGAG AAGATGACCA GTGATAGGGT AATTAACATT ATCCAGCAAC TCGATATAGA CTCCACCCTA GGAGCTCTTG AGAAGCTCAC CCCAGTACTT AACAAGCTAA CTAGCGATAA GGCGATAAAG GTACTACAAA GTATAGACTA CGACTCCCTC CTGGAGTACA TGGGATCCCT GACACCGCTA CTTAGTAAGC TGACGAGCGA GAAAACCTTG AAGATAATAC AGAGCCTAGA TATGGACGCG CTTCTGAGCG CTGCTGAAAC CATGACACCA ACGCTCGCCA AACTTGCTAA CATGATGAGC GAGATGCAGA AATCCGGCCA ACTGGATAAT CTCATGAACC TAATGCAACA GGGACTTGCT CTACTTGACA CTGTCCAGAA GACCGACCTA ATAAATACCC TGATAGCCTT TGGTATGGAT CAGATAGGGA AGGTTCAGGC TCTGTGGCCT CTCCTGGAGA AGATGACCAG CGAGGATACA ATTAACATGA TTCAGAAGAT GGACATTGAC GGTTTACTCA AGGCTATGAA TTCCTTAATG CCCATGATGC AGAAGCTCAC AAGTGATAGG GCAATTAAGC TGATCCAGCA ACTGGATGTA GAGAGCATGT TGGGAGCCTT TGAGGCGTCA ATGCCCATGT TGAAGAAGCT GACCGACGAG AAAACAGTTA AGGCACTCGC TCAGATGGAC ATGGATTCCA TGATCAACCT AATGATGAAG TTTGCTGAAC TGCAGAGAAC TGGAGTTATG GATAGGATGT ACAAACTCAT GGACGTAATG GCAGATCCAC AACTGGTGGA TACAATGGTT TCAGTCATGG AGAAGTTCGC CAAGGCCATG AAGATATGGG CTAGCGATCT GCCTAACGTG AAACCTGTGG GTATAGGAGG ACTAGCAGGC CTGACAAGGG ATCCAGATTC CAAGTATGCC CTTGGAATAA TGACCTCCCT ACTCAAGGCT ACTGGGAAGG CGTTCAAGGA GTAG
|
Protein sequence | MSQQDPLEIL LKEENLQKIT KLMDVLPAFE KVAEKLSEMD KKGELDFMLD MLGQVVSIAD AMQKADLMNT LVSFGMDQIG KVQALWPLLE KMTSDRVINI IQQLDIDSTL GALEKLTPVL NKLTSDKAIK VLQSIDYDSL LEYMGSLTPL LSKLTSEKTL KIIQSLDMDA LLSAAETMTP TLAKLANMMS EMQKSGQLDN LMNLMQQGLA LLDTVQKTDL INTLIAFGMD QIGKVQALWP LLEKMTSEDT INMIQKMDID GLLKAMNSLM PMMQKLTSDR AIKLIQQLDV ESMLGAFEAS MPMLKKLTDE KTVKALAQMD MDSMINLMMK FAELQRTGVM DRMYKLMDVM ADPQLVDTMV SVMEKFAKAM KIWASDLPNV KPVGIGGLAG LTRDPDSKYA LGIMTSLLKA TGKAFKE
|
| |