Gene Pars_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2254 
Symbol 
ID5054286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2019890 
End bp2021161 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content60% 
IMG OID640469807 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001154452 
Protein GI145592450 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.100829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0981223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTTG AAAGGGCTAG GCAAGTCTTC CCCGGCGGGG TTAATTCCCC TGCCAGGGCT 
CTCAAACACC TCCCGTCGTC GCTCGTCGCA AGGGCCGCCT CTGGGCCCTA CCTATACACC
GACCGCGGGA GGCTTGTGGA CTACTGCATG GCGTTTGGCG CCATAATCCT CGGCCACGCC
CACCCCCGGG TGAAAAGGGC CGTGGAGGAG CAGCTGGAGA GGGGCTGGAT ATACGCCCTG
CTCACCGAGC AGGAGGTGGA ATTCGCCGAG GCCATAAGGC GGCACATGCC CTCTGTGGAG
AAGATGCGGA TAGTGAATAC TGGAACCGAG GCCACGATGA ACGCCATAAG GCTCGCCCGG
GGCTACACGA AGCGCGACGT GATAATTAAA TTCGACGGAA ACTTTCACGG CTCCCACGAC
TATGTTTTGG TCAAGGCCGG CTCCGGGGCG GCGACTTGGG GCATACCCAC AAGCGCCGGC
GTGCCGCAAG ACGTAGTCAA GCTGACGGTA GTGGCGCCTT ACAACGACGT AGACGCATTC
CTCAAGGCAG TAAAGGAAGT GGGGGACAGA CTAGCGGCGG TGATTGCGGA GCCGGTGGCG
GGGAACTACG GCCTCATAAT ACCCGACGCG GAGTTTCTTA AGGCGCTGAG GGAGGAGACC
AAACGCGTAG GGGCCCTCCT GATATTTGAC GAAGTAATTA CGGGCTTTAG GCTGGGCCTC
GGCGGTGCCC AAGGCCGCTT CGGCATAAGG CCAGACCTCA CCACCCTGGG CAAGGCCGTG
GGCGGAGGCT TCCCCATCGG TATATTCGGT GGAAGGGCAG AGGTGATGGA CTTGGTCGCG
CCCAGCGGCC CCGTGTACAA CGCAGGCACG TACAACGCCC ATCCTGTCTC GGTGACTGCC
GGCCTCGCCG TGTTGAAAGA GCTGGAAACC GGTGAGCCCT TCCGCACAGC AGACGAGGCG
GCGGAGAGGC TTGCCAAGGG CATAGAGGAC ATCGCCGGGA GGCTCGGCTT TGACGTGGTC
GTGAAGAAGA TAGCCTCCAT GTTCCAGTTC TACTTCAAGA AAGGCGACGT GAAGACCCCC
CAAGACGTCA GGGAGAGCAA CGAGAAAATG TACCTAAAAC TCCACGAGAT CGCGCTTAGA
CACGGCGTCT ACCTAACCCC CTCCCAGTTC GAGGTGAACT TCACATCGGC AGCTCACACC
AGAGAGGTGG TCGAGGAGAC CCTCGCCGCG CTGGAGAAGG CCTTTCAACA ATTAAAGACG
GAAATCGGGT AG
 
Protein sequence
MLFERARQVF PGGVNSPARA LKHLPSSLVA RAASGPYLYT DRGRLVDYCM AFGAIILGHA 
HPRVKRAVEE QLERGWIYAL LTEQEVEFAE AIRRHMPSVE KMRIVNTGTE ATMNAIRLAR
GYTKRDVIIK FDGNFHGSHD YVLVKAGSGA ATWGIPTSAG VPQDVVKLTV VAPYNDVDAF
LKAVKEVGDR LAAVIAEPVA GNYGLIIPDA EFLKALREET KRVGALLIFD EVITGFRLGL
GGAQGRFGIR PDLTTLGKAV GGGFPIGIFG GRAEVMDLVA PSGPVYNAGT YNAHPVSVTA
GLAVLKELET GEPFRTADEA AERLAKGIED IAGRLGFDVV VKKIASMFQF YFKKGDVKTP
QDVRESNEKM YLKLHEIALR HGVYLTPSQF EVNFTSAAHT REVVEETLAA LEKAFQQLKT
EIG