Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1137 |
Symbol | |
ID | 5103485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1073299 |
End bp | 1074216 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507029 |
Product | hypothetical protein |
Protein accession | YP_001191222 |
Protein GI | 146303906 |
COG category | [S] Function unknown |
COG ID | [COG5551] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01877] CRISPR-associated endoribonuclease Cas6 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.19081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAGAA ACTTGGCCCT TCACGCCCCA CCCAAGTGCT CTTATTATCC GCGCTTGACA TGTCAATTTA TGCAACTCAT GAAAATGACC TTCAACGTGA CGCCCCTTCA CGACGTGGTT CTGCCACCCC TGTCCTCAAA GGTGTTGAAG TACCTGGTCC TATCTCAACA GGTTCTTCCG TTCCTCGAGG AGCTTGTAAG GTCGAAGGAT AAGCAGAAGC CCCTCTTCAT TTCCAACCTT GCCCTAGATG GGAAGAGGCT TTACTCCAGG GGAGAGCCGA TCACAGTGAA GGCGAAAACA AGGTTAACGG GTTCGGTCAC TTTCCCCTTC TCCAAGGAGG CCTTTAACGT TGGGGGAGGG AGGGTAAAGA CGGTTTACGG TGAGTACGAG ATTTCCCTGA AGGAGGTGTC CGTCCTAGAT GAGACCCCCT CAACCTCGAC AAGAGGGAAT CTCAGGGTCT CCTTCCTGAC TCCAGCCCTC CTTTGCTCCA AGATTTACCT CCCTCCCTTT CTCAGGGAGA AGTACAGGAG GAAGAAGATT GGTTTCTCGC TGATCCCAAC CCCTGGTCTC GTCGTTGCAT ATGGGTATAG GCAGTACCTC GCCCTCCTCG GGAAGACTGA CAGTTACGAG AACGACATAA AGACCTTTAA GCTCCTAGTT ATGGCGAATG CCCTGTCCCG CGTCGTAGGG TATAGGCTTT ACCCCGAGAC GGTTGTGATT GGGGAGGACG AGAAAGGCAG ATTAAGGCTA ACGAGGGGGG TGAAGGGTTG GATAGAGTTT GACATCGTGG GGAAGCTGAA GGAAAGTGCG GCTAAGTACC TTGAGGTTGC CTCCTTCCTA GGGATTGGGA GGAGTAGGGG GATTGGGCTA GGGGAGGTTC ACTTCAAGAT GGTCGAGAGG GGAGAGAATA GTCACTGA
|
Protein sequence | MHRNLALHAP PKCSYYPRLT CQFMQLMKMT FNVTPLHDVV LPPLSSKVLK YLVLSQQVLP FLEELVRSKD KQKPLFISNL ALDGKRLYSR GEPITVKAKT RLTGSVTFPF SKEAFNVGGG RVKTVYGEYE ISLKEVSVLD ETPSTSTRGN LRVSFLTPAL LCSKIYLPPF LREKYRRKKI GFSLIPTPGL VVAYGYRQYL ALLGKTDSYE NDIKTFKLLV MANALSRVVG YRLYPETVVI GEDEKGRLRL TRGVKGWIEF DIVGKLKESA AKYLEVASFL GIGRSRGIGL GEVHFKMVER GENSH
|
| |