Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1156 |
Symbol | |
ID | 5103504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1121624 |
End bp | 1122847 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640507048 |
Product | protein of unknown function DUF395, YeeE/YedE |
Protein accession | YP_001191241 |
Protein GI | 146303925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.643231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0925546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAACCT TTACAGCACC CATGTGGGTT GGAATCCTCA TAGGCTTCAT CATAGGCGCA GCAGCGGAGG CCTGGGGGAT AGCAAACCCA GAGACGCTAA TAAGACTAGC GAAGTGGGAG GATAGACTCT TCGTAATATG CATCGCCCTG GGCCTGGCGA TCTCGACTCC CGTACTATTT GGGCTCTATG CCCTCGGGGT GGGTTTCCAC TTCAGCCCGA AGCCACTCTA CCTAGTTGGA GTTGGCCTCG GCGGAGTCCT GTTTGGAGCA GGGCTGGCAA TAGCCGGGTA TTTCCCTGGT TCCATCTGGA TGGCCCTAGG TGAAGGAAGA AGGGACGCAA TCTATGCTTT ACTGGGAGCA CTCCTCGGAG CTGCCTCGTG GACGGCCCTG TACCAGACCA GCGTTGGCCA GTGGCTAGTG AGTACTCTGA ACTTCGGTAG CCTCGTGATC GGTGGAAAGC ACGTGTCTAC CTTCGTGATT CAGCCGTTCC AGGGACTGAC CCCCGTGGAC CTTTTCGGGA TATCCCTAGT TTACGCGGTC GGCCTCTTCC TAGTCGCATA TTACCTTCCG AGGTATAAGG GAGGACAGAG GAGTTGCATT AGGGAGAACC TTGAGAGGAG GAACACTCCC GTCGAGGTCC AGAAGCACCT CGACACAGCA ATCTACTTGA CCGATGGCGG TCTACCCTAC TCCCAGACCT CTCTAGCCAA GAAGGTGAAC GAGTACTACG CAACGGAGAG CAACGTAACC AGGTGGTTCA TGGTCTCCAT CGCCGGTATC GTGGGGCTTA CTGTGGTACT GGAGATGTTC CTTCACCAGA TATTCGGCGA ATCCACAACC TACTCCTGGA TAGTTGGGCA ACTCTTCATG CCATCTTTCA AGTATAGCCA GATAGTCTTC AAGGGGATTG GATGGGAGCC CTTCAGCGAC ATTGGGACCT TGATGGGAGC CTTCTTCAGC GCAGTCTTCA TTACTAGGAG GTTCACATCC TTTAGGAACA TCATACCGCC AAGCTGGGCC CACAGGTTCG GGACAAATGA GGCAGTGAGG TTCGTGGGTT CCTTCCTGGG AGGTTACCTG ATGCTGTTCG GAGCCAGGAT GGCAGGCGGT TGCGCCAGCG GACACATCCT CAGCGGTGAC ATGCAGATGG CCCTGAGTGG TCTCGAGTTC ACAGCAGCCG TTTTTGCAGC AATGATCATA ACTGCGAAGG TGGTGTACAA ATGA
|
Protein sequence | MITFTAPMWV GILIGFIIGA AAEAWGIANP ETLIRLAKWE DRLFVICIAL GLAISTPVLF GLYALGVGFH FSPKPLYLVG VGLGGVLFGA GLAIAGYFPG SIWMALGEGR RDAIYALLGA LLGAASWTAL YQTSVGQWLV STLNFGSLVI GGKHVSTFVI QPFQGLTPVD LFGISLVYAV GLFLVAYYLP RYKGGQRSCI RENLERRNTP VEVQKHLDTA IYLTDGGLPY SQTSLAKKVN EYYATESNVT RWFMVSIAGI VGLTVVLEMF LHQIFGESTT YSWIVGQLFM PSFKYSQIVF KGIGWEPFSD IGTLMGAFFS AVFITRRFTS FRNIIPPSWA HRFGTNEAVR FVGSFLGGYL MLFGARMAGG CASGHILSGD MQMALSGLEF TAAVFAAMII TAKVVYK
|
| |