Gene Pars_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2189 
Symbol 
ID5054398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1960679 
End bp1962112 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content64% 
IMG OID640469741 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001154387 
Protein GI145592385 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGGG TGCTTGTAAT TGGCGATGGG GCTCGGGAAC ACGCATTGGC GTGGGGGCTT 
GCGAGGAGCG GGGTTAGGCT CTACGCCTTG ATGGGGCACC TCAACCCGGG CGTTGCCCAG
CTGGTGAGGG AGAGCGGGGG GTCGTACCGG CTTGGCTCCC CGACTAGCGC AGCGGAGGCG
GTTAAGGCGG CTGAGGAGTT CTCCCCAGAC CTAGTGGTGG TTGGGCCGGA GGAGCCTCTC
TTCGCTGGGG TCTCCGACGC GCTTAGGGAG AGAGGCTTCA TAACTCTAGG CGCGTCTTCT
GGGGTGGCCA TTATTGAACA GAGGAAGGAC GTGGCGAGGG GCCTTCAGTG GAAGTACGGA
GTCCCCGGGC GGCTGGTATA CGGCGTATTC GCAGACGTCG CTGAGGCCTA CTCCTTTGCC
AAGGCCCTCG GCTCGGTGGC CATCAAGCCG GTTAGGCAGG CAGGTGGGAA GGGTGTGAGG
GTGGTCTACG GGGAGGCCAA GTACCTAGAC AGCACGCTTG ACGAGGTCGT CGCCAGGGGG
GCGCAGGAGG CAAAGGCCCA GCTGGCCTCG TACGGGGATG TGCCCCAGGC AGTGCTCGTG
GAGGAGGCGG TGTGGGGGGT GGAGTACACG GTGCAGGCTC TTGTAGACGG CGAATCGGTC
TTCGCGTTTC TCCCCGTACA GGACAACCCG CATGCCTACG AGCTTGGCCT TGGCCCGGAG
TGCGGGGGCA TGGGCACCGT CTCTCCCCTG CCGTTTATAG AAGAGGGGGA ATTCCACGCG
GCTGTTGAGG CGATTAAGGC GACGGCTGAG GCCGTGAGGC GCGAGTTCGG CGTGGAGTAC
GTGGGCGTCT TAAGTGGGCA GATGATGCTC ACGGCAATGG GGCCTGTGGT TATTGAGTAC
TACAGCAGGT TCGGCGATCC TGAGGCCCTA AACGCCGTCT ACCTCTACGA CGGCGATCTC
TACGACTTGT TCCTAAAAGC GGCGACTAAA AAGCTACACA AGGCTCAGCG CAGGTTCAAG
GCGGAGTACA CCGTGGTGAA GGCAATAGCC CCCCTGGGCT ACCCCCTCGA CAGGAGGCTG
GCCGCGGGTA GGGTTTTCCA CGTGGATTGG GACGCGGTGA GGCGTGCCGG CTGCCTAGTC
TTCTTCGGCT CGGCAGAGCC TGCTGAGGGC GGCGGGTACA AGACGCTGGG CTCCCGCGCC
GTTGAGATAC TCGGCGCTGG GGCGACGCCA GAGGAGGCCT ACGAAAGGGC TGAGAGGTGC
GCCGCCGCCG TCAAGGGGGA GGGCCTTTTC TACCGGAGCG ACATTGGCTC GCCGGAGTAC
ATGGCGGCGA TGAAACGCAA AGCGGAACAG GTAAGAGCTG TCTACAAGTG GCGCGGCGAG
CGGGGGGAGC GCTTGGTGTG GGAGCCGGGC AAGGGGCTGA TCCGCTTCGG GTGA
 
Protein sequence
MDRVLVIGDG AREHALAWGL ARSGVRLYAL MGHLNPGVAQ LVRESGGSYR LGSPTSAAEA 
VKAAEEFSPD LVVVGPEEPL FAGVSDALRE RGFITLGASS GVAIIEQRKD VARGLQWKYG
VPGRLVYGVF ADVAEAYSFA KALGSVAIKP VRQAGGKGVR VVYGEAKYLD STLDEVVARG
AQEAKAQLAS YGDVPQAVLV EEAVWGVEYT VQALVDGESV FAFLPVQDNP HAYELGLGPE
CGGMGTVSPL PFIEEGEFHA AVEAIKATAE AVRREFGVEY VGVLSGQMML TAMGPVVIEY
YSRFGDPEAL NAVYLYDGDL YDLFLKAATK KLHKAQRRFK AEYTVVKAIA PLGYPLDRRL
AAGRVFHVDW DAVRRAGCLV FFGSAEPAEG GGYKTLGSRA VEILGAGATP EEAYERAERC
AAAVKGEGLF YRSDIGSPEY MAAMKRKAEQ VRAVYKWRGE RGERLVWEPG KGLIRFG