Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1584 |
Symbol | |
ID | 5104029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1531867 |
End bp | 1532817 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507470 |
Product | glycosyltransferase family 28 protein |
Protein accession | YP_001191663 |
Protein GI | 146304347 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0306453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00652045 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGAGGC TGCTCATTAT AGCGAGCGGA GGGGGACATA CAGGATTTGC CAAGGCCATC GCAGAGTATT TACCCTTTAA GGTTGACTTC GTGATTCCTG AGGGCGACGA AAACTCCAGG AAGCTCTTGA GCCCTCACGC TGAGAAGATA TACGAGGTTA GTAAGCCTAG GGATCCGAAG GGCCCAAACA CTTCCTTGGT AACAAGGGGG TTTAGGGCGC TCTCGCAATC AATTTCTCTT CCAAGTTATG ACGTGGTCAT TGCTACTGGC TCGAATCATT CAGTGATACC TTCTTTATTT CAAAGATTGA GGGGTTCCAC TCTCTTCACG CTAGAAAGCC AGGATAGGAT TGTGACCAAG GGAAAGGCAG TGTCCGTGTT ATCGCACTTC TCCAAGGGAG TGTTCCTTCA CTGGAAGGAG CAATCTAGGC TTTATAAGAA CGGGATAGTG GTGGGGCCCA TTCTTCAAAG GAGAAAATAT GATCCAGTGG ACGAGGGTTT CATCCTAGTC ACCGCGGGGA CCGAAGGTTT CAAAGCCCTT TTTGACAGGA TTTCCAGTCT AGGGCTAACA AATCTGGTAA TGCAGACTGG AAAGATATCT CCTGAGCCAT ACGTGAAAAG TGGCGTAAAA GCATTCAGCT TTGATCCAGA TATTGAGAGA TTTATCGCAG GAGCTTCCCT GGTAATAACT CATCAGGGTA AAACAGCCAT GGAGTCAGCC GTGCTCTATG GGAAACCCAC AATCATTGTC TTTAACAAGA GCCTCACCAG GGCTGCTACC CACGAGGATG TGAAACTATA CTCCGAAATA ATAGGTGCAG AATTTCTAGA CGATCCATCC ACTTGGGAAG ACGAGGAATT GCTTAAGGCC ATTCAGAAGC GTAAAAAGCC CAACTATTAT GAACCTGGGA CAGAGAGGTT AGTGAAGGTG ATCATGGATT ATCTTGAATG A
|
Protein sequence | MKRLLIIASG GGHTGFAKAI AEYLPFKVDF VIPEGDENSR KLLSPHAEKI YEVSKPRDPK GPNTSLVTRG FRALSQSISL PSYDVVIATG SNHSVIPSLF QRLRGSTLFT LESQDRIVTK GKAVSVLSHF SKGVFLHWKE QSRLYKNGIV VGPILQRRKY DPVDEGFILV TAGTEGFKAL FDRISSLGLT NLVMQTGKIS PEPYVKSGVK AFSFDPDIER FIAGASLVIT HQGKTAMESA VLYGKPTIIV FNKSLTRAAT HEDVKLYSEI IGAEFLDDPS TWEDEELLKA IQKRKKPNYY EPGTERLVKV IMDYLE
|
| |