Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1968 |
Symbol | |
ID | 5103355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1905118 |
End bp | 1906128 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507856 |
Product | phosphate uptake regulator, PhoU |
Protein accession | YP_001192032 |
Protein GI | 146304716 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0704] Phosphate uptake regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAGCA GATCGTCAAG AAGGATCCAA TTAACAGGGG GATCCACTTA TATCATCTCC TTACCTAAGT CCTGGGTAAG ACAGCTATCT TTAAACCCAG GGGATGAAGT TGAGGTAATT CAGGACAACA ACTTTAGGCT TCTCCTAGTC CCTAAGGGGA TTCCACAGGA CACCAAGCAG AACAGGGCAA CCATTACATG TGAAAATTTG AGGCCAACCT TCGCAGTTAG GGAGTTCATC GCGTATTACA TGGCTGGTTT CACAATAGTC TCACTGATAT GCCCCAAGAT GAAGGCTGAG GATAGGGCCA TGGTAAAGGA CAGCGTAAGG AAAAGATTGC TTGGGGCTGA GGTTATAGAG GAGGACAACT CAAACCTGAC TGTTCAGTTC CTGGTTAACG AAAAGGATCT GCCAATCTCG AGGGCCATAA ACAGGGCAGC CGTGATCACC CAGAACATGT TAAAGGATAC TCTTGACGCC CTGAGGAATA ACGACGCGGA GATGGCCAAG GAGGTCCAGG AGAGAGACGA CGAGGTGGAT AGGTTCTACT TTTACGTAGC TAGACAACTC ACTCTAAGCA TAAGTTCATT TGAGATACTT GAAGAGGAAG GTTACAATGC CACCCAGATC GTGGACATTT ACTCCGCGGT AAAATCCATT GAGAGGATAG CAGATCACGC AAGTAGGATC TCCGGTTTGA CACTAGAAGT TGGTCCACAA ACGCCTCAGC CAATACTGGA ATTTGGGAAC AAGGTTCTTG AGGTTTACAA GGAATCCACT AGGGCATTTC TAAACGGCAA GAGGGAGATA GCTAACAAGA TCATCGATCA AGATTACGAG CTAGCCATAG AGCATAAGAA GGTCACGGAG ACAATCTTTA GGTCAAGTGA GGCCATGAAA CCCTCACTCT TACTTATCAC GGACTCCTTC AGGAGGATTA GCAGGTATTC TTTGGACCTT GCTGAGACTA CCATAAACCT GCTGGCAAAA ACTAAGACTA TTGAATCCTA G
|
Protein sequence | MQSRSSRRIQ LTGGSTYIIS LPKSWVRQLS LNPGDEVEVI QDNNFRLLLV PKGIPQDTKQ NRATITCENL RPTFAVREFI AYYMAGFTIV SLICPKMKAE DRAMVKDSVR KRLLGAEVIE EDNSNLTVQF LVNEKDLPIS RAINRAAVIT QNMLKDTLDA LRNNDAEMAK EVQERDDEVD RFYFYVARQL TLSISSFEIL EEEGYNATQI VDIYSAVKSI ERIADHASRI SGLTLEVGPQ TPQPILEFGN KVLEVYKEST RAFLNGKREI ANKIIDQDYE LAIEHKKVTE TIFRSSEAMK PSLLLITDSF RRISRYSLDL AETTINLLAK TKTIES
|
| |