Gene Msed_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1968 
Symbol 
ID5103355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1905118 
End bp1906128 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content46% 
IMG OID640507856 
Productphosphate uptake regulator, PhoU 
Protein accessionYP_001192032 
Protein GI146304716 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0704] Phosphate uptake regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGCA GATCGTCAAG AAGGATCCAA TTAACAGGGG GATCCACTTA TATCATCTCC 
TTACCTAAGT CCTGGGTAAG ACAGCTATCT TTAAACCCAG GGGATGAAGT TGAGGTAATT
CAGGACAACA ACTTTAGGCT TCTCCTAGTC CCTAAGGGGA TTCCACAGGA CACCAAGCAG
AACAGGGCAA CCATTACATG TGAAAATTTG AGGCCAACCT TCGCAGTTAG GGAGTTCATC
GCGTATTACA TGGCTGGTTT CACAATAGTC TCACTGATAT GCCCCAAGAT GAAGGCTGAG
GATAGGGCCA TGGTAAAGGA CAGCGTAAGG AAAAGATTGC TTGGGGCTGA GGTTATAGAG
GAGGACAACT CAAACCTGAC TGTTCAGTTC CTGGTTAACG AAAAGGATCT GCCAATCTCG
AGGGCCATAA ACAGGGCAGC CGTGATCACC CAGAACATGT TAAAGGATAC TCTTGACGCC
CTGAGGAATA ACGACGCGGA GATGGCCAAG GAGGTCCAGG AGAGAGACGA CGAGGTGGAT
AGGTTCTACT TTTACGTAGC TAGACAACTC ACTCTAAGCA TAAGTTCATT TGAGATACTT
GAAGAGGAAG GTTACAATGC CACCCAGATC GTGGACATTT ACTCCGCGGT AAAATCCATT
GAGAGGATAG CAGATCACGC AAGTAGGATC TCCGGTTTGA CACTAGAAGT TGGTCCACAA
ACGCCTCAGC CAATACTGGA ATTTGGGAAC AAGGTTCTTG AGGTTTACAA GGAATCCACT
AGGGCATTTC TAAACGGCAA GAGGGAGATA GCTAACAAGA TCATCGATCA AGATTACGAG
CTAGCCATAG AGCATAAGAA GGTCACGGAG ACAATCTTTA GGTCAAGTGA GGCCATGAAA
CCCTCACTCT TACTTATCAC GGACTCCTTC AGGAGGATTA GCAGGTATTC TTTGGACCTT
GCTGAGACTA CCATAAACCT GCTGGCAAAA ACTAAGACTA TTGAATCCTA G
 
Protein sequence
MQSRSSRRIQ LTGGSTYIIS LPKSWVRQLS LNPGDEVEVI QDNNFRLLLV PKGIPQDTKQ 
NRATITCENL RPTFAVREFI AYYMAGFTIV SLICPKMKAE DRAMVKDSVR KRLLGAEVIE
EDNSNLTVQF LVNEKDLPIS RAINRAAVIT QNMLKDTLDA LRNNDAEMAK EVQERDDEVD
RFYFYVARQL TLSISSFEIL EEEGYNATQI VDIYSAVKSI ERIADHASRI SGLTLEVGPQ
TPQPILEFGN KVLEVYKEST RAFLNGKREI ANKIIDQDYE LAIEHKKVTE TIFRSSEAMK
PSLLLITDSF RRISRYSLDL AETTINLLAK TKTIES