Gene Pars_1419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1419 
SymboltrpD 
ID5054451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1280094 
End bp1281095 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content62% 
IMG OID640468960 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001153629 
Protein GI145591627 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.195691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCTAA AAGCCCTATT GAGAAAGCTC GGCAACGGCC TTAGGCTTAC GGCCGACGAG 
GCTTACCTCC TCGGGAGGGG GATCCTCTCA GGTTCCCTTT GCGACGTCGA GGTGGCGGCT
TCCCTCACTG CTATGAGGGT GCGGGGTGAA TCGCCGGAGG AGGTGGCTGG GTTTGTGAAG
ATGGCTAGGG AGTTCGCGGT TAGGGTGCCC CTCCGCATAG AGGCGGTGGA CACGGCGGGG
ACTGGCGGGG ACGGCGCCGG TACCATAAAC CTCTCGACGG CAGCGGCCAT CGTCGCCGCG
GCGGCTGGGG CTAAGGTGTT GAAGCACGGG AATAGGTCAG CCTCAGGCCT CTTCGGCAGC
GCCGACTTTA TGGAGGCGGT TGGCTACAAC TTAGAAGTAG GGCCTGAGAA GGCGGCTGAG
CTTGTGGAAA AGGTGGGCTT CGCCTTCGTC TTCGCGCCTA GATACCACCC GGCCTTCGCC
AAAGTGGCGC CTGTGCGCCG CGCCCTGCCG TTCCGAACTA TTTTCAACAT CGTCGGCCCC
TTGGCCAACC CGGGGCTGGT GAAGAGGCAA CTCATCGGCG TTGCTGAAGA GAGGCTTCTC
GAAGTTGTGG CGGCCGCCGC GGCTGAGCTG GGTTTTGAAC ACGCAGTGGT TGTCCACGGC
TCTGGAGTAG ACGAGGTGTC CAGTGAGGGG GCGACGACGG TGTACGAGGT GAAGAGGGGG
TCGTTGGAGA GGTACCAAAT AGCGCCGGAG GATTTAGGCG CCCCCCGCGT CCCCATACCG
CGTGCCTCTG ACAAAGAAGA GGCGGTGGCT AAGGCCTTGG CTGGGCTTCG GGGAGAGCTG
AGGGAGGCCT CCGTGGCCAT TGCCCTAAAC GCCGCCTTTG CCCTCTACGT AGCAGGAGTG
GTTGGAGATC CGAGAGATGG CTTTGAGCTT GCCATGAGGG CGATACAAGA GGGGGTAGCG
TATCGGAAGC TGGTAGAGGC GGTGGAGGCA TCTCGGACAT GA
 
Protein sequence
MDLKALLRKL GNGLRLTADE AYLLGRGILS GSLCDVEVAA SLTAMRVRGE SPEEVAGFVK 
MAREFAVRVP LRIEAVDTAG TGGDGAGTIN LSTAAAIVAA AAGAKVLKHG NRSASGLFGS
ADFMEAVGYN LEVGPEKAAE LVEKVGFAFV FAPRYHPAFA KVAPVRRALP FRTIFNIVGP
LANPGLVKRQ LIGVAEERLL EVVAAAAAEL GFEHAVVVHG SGVDEVSSEG ATTVYEVKRG
SLERYQIAPE DLGAPRVPIP RASDKEEAVA KALAGLRGEL REASVAIALN AAFALYVAGV
VGDPRDGFEL AMRAIQEGVA YRKLVEAVEA SRT