Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1590 |
Symbol | |
ID | 5103954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1536899 |
End bp | 1538305 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640507477 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001191669 |
Protein GI | 146304353 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00816719 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0063014 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTTATCGC CAGTAACGAC CAGGGACATT CCTCTCTATT CTCGTGGGTT CTTCATCTTT ATAACAGGTA GTTACGTTAC TTACTGCATG AGAATTGCCA TTAATACTCA AACTCCTCCA ATCCGTTTCA ACTTCACCTA CAAGGAACTC TTGGAGAGAT ATGGAGAAGT TCCAATCCCA CTGGATCTGA GCTCACTTTC TGACCAAGAT TATCAGATCT CAGTGGGCGG AGTAGCTAGG ATGATGATGT CCCTAGTCAA AACCTCAGGA TTTGAGAGGA GCAGATGGGT AGCCTTAGGA CCTGGATACC CTCCTCAGGT TGTGCTGGAA GGAATAGATA TTCACTTCAT CGATTTACCT CCAACTCTTC TATCCAGTTA CACCAGATTC AAGGAAGGAT TATACAATGA AGCTCACGGC ATGTACAACT ATAATATCGT TGGCTCAGAC TACATAGCTT ATGCCTCCTA CAACTGGCAT TCTGCCCTTA AACTCCTGGA ATTCTATTCT GATACAGACG TATATCTGAT AAATGATTTC CAGCAGTTAC TCGTGGGAGG GATAATTGGC CCATCAGCTC CCGCAGTCCT ATGGTACCAT ATTCCCTTTG TGCCAGAGAG GTTAAGTGAT AGGGTCAGAG AGTTCCTAGT TAGGGCATTT GAGGGTTTCG ATCTGATCAT ATCCAGTACA AAACGTGACC TAGAGGGCTT AGTCAGATCA GGTGCCAAGG TGAAAGCTAG ACAAATATAT CCCTTCATAG ATCCCTCAGC TTACTCCCAC GTAAGCAAGA GCGAAGTCGA ATCTGTTTCA GAGAAATTTG GGATCAAAGA GGATGATAAG GTAGTCCTGT TAGTGGGAAG AATGGACCCG ATAAAGAGTC AGGACATAGC GATTAAGGCT ATCAAAAACA CCAACTTGAA GCTGGTAGTG GCAGGGAACG GAAGTTTCAC CAGTAAGAGC CTTGGCCATG ATAAGGCTAG TATTTGGGCA AGAAAGCTAA GGGACTTAGT GGTGGAGCTA GGAGTTCAGG ATAAGGTCAT TTTCACAGGA TATGTGAGCG ACCATGAACT AATGGCTTTA TATCAAAGAA GCGACGTGGT CACTTTGACC TCAAGAAGCG AGGGGTTTGG GTTAACAATA TGTGAAGCGT GGAATTACAA GAAGCCAGTG GTAGTAAGCG AGGGAGCCGG CGTAAGTGAG TTGATCATAA ATGGGGTGAA TGGATACACT TTTTCGCCTG ATAAACCAGA TCAAATGTCA TGGGCTTTGG TGGAAGCGGT AAAGAATTAC GACAAGTTAG GTCAAAGAGG TTATGAGACA TTGCCACAAT GTTCCGTACA AACAGCGTCA GGAAAAGTAA AGGAAGCATT AGAGGAGGCT CAAAGAGGTT ACATTCATAG CAAGTAA
|
Protein sequence | MLSPVTTRDI PLYSRGFFIF ITGSYVTYCM RIAINTQTPP IRFNFTYKEL LERYGEVPIP LDLSSLSDQD YQISVGGVAR MMMSLVKTSG FERSRWVALG PGYPPQVVLE GIDIHFIDLP PTLLSSYTRF KEGLYNEAHG MYNYNIVGSD YIAYASYNWH SALKLLEFYS DTDVYLINDF QQLLVGGIIG PSAPAVLWYH IPFVPERLSD RVREFLVRAF EGFDLIISST KRDLEGLVRS GAKVKARQIY PFIDPSAYSH VSKSEVESVS EKFGIKEDDK VVLLVGRMDP IKSQDIAIKA IKNTNLKLVV AGNGSFTSKS LGHDKASIWA RKLRDLVVEL GVQDKVIFTG YVSDHELMAL YQRSDVVTLT SRSEGFGLTI CEAWNYKKPV VVSEGAGVSE LIINGVNGYT FSPDKPDQMS WALVEAVKNY DKLGQRGYET LPQCSVQTAS GKVKEALEEA QRGYIHSK
|
| |