Gene Pars_1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1418 
Symbol 
ID5056425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1278824 
End bp1280053 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content63% 
IMG OID640468959 
Producttryptophan synthase subunit beta 
Protein accessionYP_001153628 
Protein GI145591626 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.955959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.212378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAGACA GGTGGTACAA CATCGCCGCT GACCTCCCCG CTGTCTTGGC TCCTCCCAAA 
GACCCAGACG AGGGCGAGAG TAGAATCGGC CTGCTTACGA GAATACTCCC ATCCGCGCTT
ATAGACCAAG AATTTTCTGC AGAGCGCTGG ATTACTGTTC CAGAAGAGGT TAGAGACGCG
TACCGGCGTG TTGGTAGGCC GACGCCCCTC CTCCGCGCCG AGGGGCTGGA GAGGGCTCTG
GGCACGGGGG TGAGGATATA CTACAAGTAC GAGGGGGTGC TCCCAGTGGG TAGCCACAAG
CTCAACACTG CCCTGGCACA GGCCTACTAC GCCAAGGCCG ACGGCGCGGT GGAAGTGGCC
ACCGAGACGG GGGCGGGGCA GTGGGGCATG GCTGTCTCCC TCGCGGCTGC TCTCTTCGGC
CTAAAGGCAG TGGTGTTTAT GACCCGCTCC TCCTACAACT CAAAGAGGCA GAGGCTGACC
TTTATGAGGA CTTACGGCGC GACGGTGTAC CCCAGCCCCA GCGAAGTGAC GGAGGCGGGG
AGGAGGCATT ACCGGCCGGA CCACCCAGGC TCGCTGGGGA TCGCAATATC GGAGGCAGTG
GAGTACGTCC TATCCGGCGA GAAGAGGCAC TACCTTCCGG GCAGCGTCTT GGAGTTCGTG
CTCATGCACC AGACCGTCAT AGGACTAGAG GCGGTTAGGC AACTGCCGGA GGAGCCGGAC
GCCGCCGTGG CCTGCGTTGG CGGGGGGTCG AACTTCGCCG GCTTTACCTA CCCCATGATC
GGGATGAAGC TGAGGGGCGA GGGCTTCGAC AAGACGAGGT TCGTCGCAGT TGAGGCGGAA
GCCGCCCCCA AGCTCACAAA GGGGGAGTAC AAATACGACT TCCCAGACGC CGTGGGGATA
CTCCCCATGA TCAAGATGTA CACCTTAGGC CACGACTACG TCCCGCCGCC CGTCCACGCG
GCCGGCCTCC GGTACCACGG CGCCGCGCCG TCCCTCTCCT TGCTTCGGAA ATTGGGGATA
GTGGAGCCGC TCTCCTACCC CCAGGAGGAG GTCATGAAAG CCGCAGTGCT CTTCGCGAGG
ACGGAGGGCA TTGTACCGGC GCCGGAGTCG GCCCACGCGA TAAGGGCAGT GCTAGACCTC
GCAAAAAAGC TCCCGCGCGG CTCGGTAATA GCGTTCAACC TCTCCGGCCA CGGCCTCCTC
GACTCCGACG CCTACGAGAA GTTCCTGTAA
 
Protein sequence
MVDRWYNIAA DLPAVLAPPK DPDEGESRIG LLTRILPSAL IDQEFSAERW ITVPEEVRDA 
YRRVGRPTPL LRAEGLERAL GTGVRIYYKY EGVLPVGSHK LNTALAQAYY AKADGAVEVA
TETGAGQWGM AVSLAAALFG LKAVVFMTRS SYNSKRQRLT FMRTYGATVY PSPSEVTEAG
RRHYRPDHPG SLGIAISEAV EYVLSGEKRH YLPGSVLEFV LMHQTVIGLE AVRQLPEEPD
AAVACVGGGS NFAGFTYPMI GMKLRGEGFD KTRFVAVEAE AAPKLTKGEY KYDFPDAVGI
LPMIKMYTLG HDYVPPPVHA AGLRYHGAAP SLSLLRKLGI VEPLSYPQEE VMKAAVLFAR
TEGIVPAPES AHAIRAVLDL AKKLPRGSVI AFNLSGHGLL DSDAYEKFL