Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0936 |
Symbol | |
ID | 5104366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 863889 |
End bp | 865025 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640506839 |
Product | hypothetical protein |
Protein accession | YP_001191032 |
Protein GI | 146303716 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.401072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTAG GAAGCTCCCT TTTAGTCTTG TTTTTCTTAG CATGTTCATT GACCTGTTCT TCATGTGTTG GTGTACTTAA ATATACGGTA AATTTGGCAA CGTCCACTAT AGAAAATGGT GGGCCCTATA ACACGTTTTC CCAAATTTCT CCCACTCGAA TGGTAGCAGA CAATGAGGGG AGAATATTCG TTGCTGCGGT CAATACGGTA TTCGTGCTAA GTCCCAATGG ATCAGTTATT AAAAACATAA ACGTAAGTGG GGCGGAATAT ATAGCATTTG ACAACAGAAC AAACATAGTA TACGTAACTA GTGGCACGCT TCCAATCCAT ACCATAACAG AAATAAGCGA TGCGAATCTA AGTGTCATTC GACAGCTGAC CGTATCCGCT TATCCCCTTG CCTTAGCTAC TGACCCAACT ACTGGAAAGG TCTTTATCGC GATAGGGAAT GAAGTGTATG CCCTTAACTC CACTAAGTTG GTTCCGTTAT TTAATTTCCT TGGAGTACCT ACAGATATGG TAGTCTCCCC ATCGGGTAAT ATTCTCGTCT CTACTTATAA TTTTACTGAA AATAAAGGAT ATATCTTCAT GAACTTTCAG GGAAGGACCT ATTCCCTCGA GTTGAACACG TTCCCAAACT CTCTCCTTTT AGAGGAGAAT GAAATCCTTG TGGGAACAGA CGGCTACATC TTGGAAATAA ACCTCGCCCT AAATATAGTG GGGAATATAT CGCTTTACGG TGGCAAGATA GAAGGAATGG CCTATGACAG TAATAATGAT CTGGTATATG CTGCCGTGGA TAGCCTTTAC GGCCAAGACT ACGTTTTGGT GCTGAATAAT TTATCACCAA TCGGAGAAAT AAATGTTGGA ATAACTCCCG TAGATGTAGT ATTTGATCCT GTGAGCAATT ACGTTTTTGT AAGTAATTTC TTCGATGGAA CCGTAGCTAT CATCTCGCAG GGGTGTCCCA CTCAGGTAAA CTCGAGCGTA AGTTTACCTA TAATCAGGTC TCCAACGGTA TCTCCACAAA TGCCTGTCAT CTCTTCGTTT CTGCCTTTTT ATGTAACTTT TGTATTGTTA GCCTTGTTTT CTGCTTTAAT CCTGAGAAAA TATATCAGTA AGAATAGAGG AGAATAG
|
Protein sequence | MRVGSSLLVL FFLACSLTCS SCVGVLKYTV NLATSTIENG GPYNTFSQIS PTRMVADNEG RIFVAAVNTV FVLSPNGSVI KNINVSGAEY IAFDNRTNIV YVTSGTLPIH TITEISDANL SVIRQLTVSA YPLALATDPT TGKVFIAIGN EVYALNSTKL VPLFNFLGVP TDMVVSPSGN ILVSTYNFTE NKGYIFMNFQ GRTYSLELNT FPNSLLLEEN EILVGTDGYI LEINLALNIV GNISLYGGKI EGMAYDSNND LVYAAVDSLY GQDYVLVLNN LSPIGEINVG ITPVDVVFDP VSNYVFVSNF FDGTVAIISQ GCPTQVNSSV SLPIIRSPTV SPQMPVISSF LPFYVTFVLL ALFSALILRK YISKNRGE
|
| |