Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1022 |
Symbol | |
ID | 5104325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 945179 |
End bp | 946396 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640506921 |
Product | hypothetical protein |
Protein accession | YP_001191114 |
Protein GI | 146303798 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.136217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAATCA GGGACAAACA ACTAATTGAG AAAGTGGGTA AGTATTACCT AGCTTACAGG GGAGATATAG ACCTAGACAG ACTGTTAACC CAGGGCCTAG AGAACATAGA AATTAATGAA ATAACTAAAT ACTTTATTCT GTCATCCATG ACCACATGGA TCGTAGATAG AGCTTTGGCC TACCTCGAGC AGGCCATCGA AGCCAGGCTA ACCTATAGGG AATTTGTAAT AAGTTATGAT CGCGAACCTA TGGGAGCTAT TGATTTACCT AGAAGCATTC CTGTGATGTC AAGGGGAATT TATGCGTACT ATACTTACAT AAAGGGGTAT GATGCGCCGG AGTACGCTAT CATGAATTAT CTCTTGAAGC GTATATATAG TACAGCTTTA CAATACTACA ATAAAATAAA GGATGTTCGA GAGGAAATCA AATACTTTCG CGTGAAGGGT AGGATGAAAA CTAGACTGGA TCGGCTAAGG AAAGGTCTCA GCTACTTTAA GGGAGAATAT TTTAGACCTT TGACAGATTA CGATCCCGAG TGGCTAAGGG AGACTTTCAA TCTCTACTAT ACATTATCTC AGCTAAAGGA ACTTTCCTTG GGCATTTCTA CTCAGAAGGC GCCGTCTATG AATAAGAAAA TGCTTAAAGT GATTTTGTGG AAATTGTATG AGCTTTACGT TTTCTTCATT TTCGTTAAAT ATCTAGAGAG GGAAGGATTC GATGTAGCGA AGGAAAACGG AAGATATGTG GCCAAGAAAG GAAACAGAAG ACTTAGCTTA ATCCTTAATA GCGATCTAGA TTTCTCGCAA CTGGACTCCG TTGATGACTT AGATAATACT GAGATTTTCA GAGGTAGACC TGATCTCTCA TTGGTAGCTG GAAATTCTGT ACTAGTCGAA TGCAAATATT CTAGCAAGGT TGGATATATT ACCTCTAGCA GATTTAAACT CATGGCATAT GCTTATGAAT ACAATCCTCT TACCGCGATA CTTATTTATC CAGGATTAGA TAAGGAGGTT GAGGTCATGG ATTCGGAGGA GAAAGCAACG TACCAGATCA ATGAGAAGGC CAAGGAGGAA GGATTCGTGG ATATTAATTT CAAAAATTCC AAAAAATTAT ATATAGTGGT CCTAAATCCT GCTGATGATG ACGAAACTAA CGAGGAGAAA ATAGCAAGGA TATTTACATC AAATAGTTAC CTAAGCAAGT TATTATGA
|
Protein sequence | MLIRDKQLIE KVGKYYLAYR GDIDLDRLLT QGLENIEINE ITKYFILSSM TTWIVDRALA YLEQAIEARL TYREFVISYD REPMGAIDLP RSIPVMSRGI YAYYTYIKGY DAPEYAIMNY LLKRIYSTAL QYYNKIKDVR EEIKYFRVKG RMKTRLDRLR KGLSYFKGEY FRPLTDYDPE WLRETFNLYY TLSQLKELSL GISTQKAPSM NKKMLKVILW KLYELYVFFI FVKYLEREGF DVAKENGRYV AKKGNRRLSL ILNSDLDFSQ LDSVDDLDNT EIFRGRPDLS LVAGNSVLVE CKYSSKVGYI TSSRFKLMAY AYEYNPLTAI LIYPGLDKEV EVMDSEEKAT YQINEKAKEE GFVDINFKNS KKLYIVVLNP ADDDETNEEK IARIFTSNSY LSKLL
|
| |