Gene Pars_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0048 
SymbolglyA 
ID5056210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp36523 
End bp37812 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content56% 
IMG OID640467628 
Productserine hydroxymethyltransferase 
Protein accessionYP_001152317 
Protein GI145590315 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCCA AAGAGCTGGA AGAGATAATC GACGTAGTAC TGGTGCACAA CAACTGGCGG 
CGTAGGGAGA CCATAAACCT CATTGCAAGC GAAAACGTCA TGTCGCCTCT TGCCGAGTTG
GTATACCTAA ACGACATGGC GGGGAGATAC GCGGAGGGGA CCGTGGGCAA CCGGTACTAC
CAAGGCACGA AGTATGTAGA CCTGATAGAA GACGTGCTTA CGAAACGATT TGCCAAAGCC
TTGGGGGCCA CCTACGTGGA CGTCCGGCCT GTCTCGGGCA CTGTGGCGAA CCTGGCGACT
TACTTCGCGC TAGTCCCCGA GGGCGGCGTG GTGGCGTCGT TGCCTGTGAA ATACGGCGGC
CACATCAGCC ACAACACCGT GGGGGGACTA AAGGCCTTGA GGCTGAAAAT GGTCGAACTA
CCCTGGGATC TCGACCGCTT TAATATAGAC GTTGACCGCG CCCGCAAGGT CATAGAGGAG
GCCAAGCCCA ATTTGGTAAT ACTGGGGGGC TCCCTCTACT TGTTCCCCCA CCCGATTAGA
GAAATTGCAG AAATAGCCAA GGCCAGCGGC GCCTATGTGC TCCACGACTC AGCCCACGTC
TTTGGCCTAA TCATCGGCGG CGTCTTTCCA AACCCGCTAA AGGAGGGGGC CCACGTAATT
ACTACGTCTA CACACAAGAC TTTCCCAGGG CCGCAGGGCG GCCTCATAGC CGCCGTGGTT
GAGGATAAGG TAAACGATCT CCAGAGAGCA GTCTTCCCGG TATTCACGTC AAATTACCAC
CTCCACCGCT ACGCCGCCAC CTACGTGACC CTAGTCGAGA TGGAGCACTT CGGCGCTGAG
TACGCCAGAA GGGTGGTGGA GAACGCGAGG GCTTTGGCCG AGGCGTTGGC GGAGCAGGGC
GTGCCGCCCG TGGCAGAGGC GCTTGGCTAT ACAAGGACGC ACCAAGTTGC TGTCGACGTA
TCCAAATTCG GAGGTGGGGA CAAAGTGGCA GCTAAGCTGG AGGAAGCGAA CATAATCGTT
AACAAAAATG CCCTCCCTTG GGACAAAAGC GTTTTGAAGC CAAGCGGCAT AAGGCTGGGG
GTACAGGAGA TGACCCGCTT CGGCATGGGC AAAGACGAGA TGCGGGAAAT CGCAAAGTTC
ATTGCAAGGG TGCTGAGCGG CGAGGACCCC GCCGGCGTCA GGCGCGACGT TGCGGAGTTC
AGAAAGGCGT ATCTGGAGAT CAAATACGGG TTTAAAATTG ACCGGGAATT AGTGGATAAG
GTATTCAAAT CGCTTAGTCT ATATACATAG
 
Protein sequence
MLPKELEEII DVVLVHNNWR RRETINLIAS ENVMSPLAEL VYLNDMAGRY AEGTVGNRYY 
QGTKYVDLIE DVLTKRFAKA LGATYVDVRP VSGTVANLAT YFALVPEGGV VASLPVKYGG
HISHNTVGGL KALRLKMVEL PWDLDRFNID VDRARKVIEE AKPNLVILGG SLYLFPHPIR
EIAEIAKASG AYVLHDSAHV FGLIIGGVFP NPLKEGAHVI TTSTHKTFPG PQGGLIAAVV
EDKVNDLQRA VFPVFTSNYH LHRYAATYVT LVEMEHFGAE YARRVVENAR ALAEALAEQG
VPPVAEALGY TRTHQVAVDV SKFGGGDKVA AKLEEANIIV NKNALPWDKS VLKPSGIRLG
VQEMTRFGMG KDEMREIAKF IARVLSGEDP AGVRRDVAEF RKAYLEIKYG FKIDRELVDK
VFKSLSLYT