Gene Msed_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0228 
SymbolpyrG 
ID5104094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp188583 
End bp190178 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content48% 
IMG OID640506133 
ProductCTP synthetase 
Protein accessionYP_001190329 
Protein GI146303013 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0116565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAAAAT ACATCGTAGT GACCGGCGGA GTTCTTTCCA GTGTTGGCAA GGGTACTGTT 
TCAGCCTCGT TAGGGCTCAT CCTAAAAAAC ATGGGTTATA ACGTGAGTAT AATCAAGGTG
GATCCCTACA TTAACGTTGA TGCAGGGACA ATGAACCCCT ATATGCATGG TGAGGTCTTC
GTTACAGAGG ACGGTGCTGA GACTGACCTG GATCTGGGTC ACTACGAGAG ATTCCTTAAC
ATTAACACAA GCAAGCACAA CAATATAACC GCTGGGAAGG TCTATTTCGA GGTCATAAGA
AAGGAAAGAG AAGGTAAGTA CATGGGCCAG ACGGTCCAGA TTATCCCTCA CGTGACAGAC
GAAATCAAGG CTATGGTCAG GAAAGTCGGT GAAGTGGAGA AGGCTGACAT AGTGATAGTT
GAGGTAGGAG GGACCGTTGG GGACATCGAG GGACTGCCGT TCCTTGAGGC CATGAGGGAA
CTCAGACTAG AGGAGGAAGA GCATAACGTA ATATTCGTCC ATGTAGCCCT TGTGGAGTAC
CTTTCTGTTA CTGGGGAACT AAAGACAAAG CCTCTACAGC ACAGCGTTCA AGAGCTCAGG
AGGATAGGGA TACAGCCAGA TATAGTTATT GCTAGATCCA TAATAGAGCT TGACGAGGAT
ACCAAGAGGA AGATCGCGCT TTTCACCAAT GTAAGGCCTG AGTACATATT CTCGAGCTAT
GACGTGGAAA CGGCATACGA GGTACCGCTC ATTCTCCAGA GACAGGGACT AGGGGCAAGG
GTCACATCTA AGCTTGGACT CCCACAAAAA ACTCCTGATT TTGGAGAGTG GGAGAAATTT
GTGTACTCGG TAAAGAGGAA AGAAGGTAAA AGGGTAAAGA TAGCCCTTGT GGGAAAATAC
ACAAAGCTCA AGGATAGTTA CCTTAGTATA AAGGAGGCAA TATATCACGC CTCTGCTCAC
CTTGGCGTGA TTCCTGAACT ACTTTGGATC GAGTCGTCGG ACCTGGAGAG GGAGAACCCA
GAGGCAATAC TGAAACAGGC AGAAGGTATC ATAGTATTGC CTGGATTCGG CTCCAGGGGT
ACAGAGGGAA AGATCAAGGC AATTAACTAC GCTAGGGTTA ATAACGTTCC CTTCCTAGGA
ATATGCTTCG GGTTACAACT GGCAGTGGTT GAGTTTGCCA GGAACGTTGT GGGTCTTCAG
GGTGCACATA GCACGGAAAT AGACCCTAAC GCTCCCCATC CAGTGGTGAC CCTGTTAGAT
GAGCAGAAAA AGGTTACGCA ATTTGGCGGA ACAATGAGGT TGGGAGCCCA GAGGATAAGC
CTAGTTCGAG GAACCCTGGC CCACTCAATT TACGGGAAGG ACGTAATCTA CGAAAGGCAT
AGGCACAGGT ATGAGGTGAA CCCCTCCTAC GTGGATCTAC TTCAGAAGCA CGGGTTAACA
ATCTCAGGAA TTAGTGACAA TGGTCTTGTG GAGATGATAG AGCTTAAGGA TCACAGATTC
TTCATAGCTA CCCAGGCCCA CCCCGAGTTC AAGAGTAGGC CCTTAAATCC AGCTCCCCTA
TTCCTTGGTT TCCTCAGGGC CGTCGTCGGG AACTAG
 
Protein sequence
MTKYIVVTGG VLSSVGKGTV SASLGLILKN MGYNVSIIKV DPYINVDAGT MNPYMHGEVF 
VTEDGAETDL DLGHYERFLN INTSKHNNIT AGKVYFEVIR KEREGKYMGQ TVQIIPHVTD
EIKAMVRKVG EVEKADIVIV EVGGTVGDIE GLPFLEAMRE LRLEEEEHNV IFVHVALVEY
LSVTGELKTK PLQHSVQELR RIGIQPDIVI ARSIIELDED TKRKIALFTN VRPEYIFSSY
DVETAYEVPL ILQRQGLGAR VTSKLGLPQK TPDFGEWEKF VYSVKRKEGK RVKIALVGKY
TKLKDSYLSI KEAIYHASAH LGVIPELLWI ESSDLERENP EAILKQAEGI IVLPGFGSRG
TEGKIKAINY ARVNNVPFLG ICFGLQLAVV EFARNVVGLQ GAHSTEIDPN APHPVVTLLD
EQKKVTQFGG TMRLGAQRIS LVRGTLAHSI YGKDVIYERH RHRYEVNPSY VDLLQKHGLT
ISGISDNGLV EMIELKDHRF FIATQAHPEF KSRPLNPAPL FLGFLRAVVG N