Gene Pars_1436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1436 
Symbol 
ID5054195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1293927 
End bp1295357 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content57% 
IMG OID640468977 
Productradical SAM domain-containing protein 
Protein accessionYP_001153646 
Protein GI145591644 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.37868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00181566 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGCGTACA GAAAGAACGC GGTGAAAATC GCCCTGTTAT ACCCCTCCAC CTATTCAGTT 
GCCATGTCGT CTTCTATTTA CCACGTCTTG TATTTCAAGC TACAAGACGC CGGTTTCTAC
GTAGAGAGGT TCACCGCCGA CCGCGGGCCC CGGGGCGTGG AAGACGGCAC TCCGCTTACC
CACTTTGACC ACATCCTCGC CACTGTGCAC TACGAGCTGG ATTACATCAA CCTAGTGAAG
ATGCTCATAG ACGCAGGCAT CCCGCCGGAG GCTGGTAGAA GGAAAAAGCC CAAGCTGATA
ATCGGCGGCC CCCCCGTGAC GGCGAATCCA GAGCCGCTTG CAGAGTTTGC AGACGCCATG
GCGCTGGGAG AACTGGAAGC CCTCTGGGAG CCGCTTCTCG CCTATCTCTC CACACGTGAG
GAGGCCGAGG GACTTTACTA CCCCGCGCGG GGGTCGCATC CGGTGTCGAT TGCCTATGCG
CCGGACGTCC GCGAAGTGGA CTACAGGAGG CTACCCGAGC CTGAGTCGGC CTTCAGTATT
TCGATCGAGG CGGCGAGGGG TTGTCCCTTC TCCTGTTTAT TCTGCATGGA GAGCTACATA
ACTAAGCCAT ACCGCCCCAG AGACTGGATA ACCGTCGTGA ACGAGGCGGA GAGGCTATAC
AAGAAGTCCG GCGTTAGGCC GTCGCTTGTG GCACTCACCG CGAACTCACA TCCACATTTC
AAGGAGATAC TCCGCGCGGC GGTTGAGAGG GGGTTGCCGC TATCTCTCCC CTCTCTCAGA
GCTGAGTTGC TAGACGACGA GGCTCTGGAG CTCATAGCTA GACTAGGGCA GAGAACCTTG
ACAATCGCCC CGGAGACCAG CGAGAGGCTG AGGAAGGCGC TTGGCAAAAA CTTCACAAAC
CAAGACGTCA TAAGAGTGGC AAAGAAGGCG TCGCAGCTGG GGCTTAAGCT CAAGCTCTAC
CTAATGGTAG GGTTGCCGTG CGAAAAGGAG GACGACCTCA AAGAGGTGGT GGAGCTCGCT
AAACAAGTTA AGCGGGTCGG GGCCTACCTA TATCTCAGCG TAAATCCTTT TGTCCCAAAA
CCACAGACGC CCCTCCAGTA CCATCCCATG GCCCCTCTTG GCTACTTAAG AAAAAGCCTC
AGCGAAATCA GGAAAGCGCC TCACGACGAG TACTCGCAAT ACGACACAAC CCTAGCGGCA
ATCCAGGCAG CGATCTCGCT AGGGGGCCGC GAGGTGTCAC GCCATATAGA GGCGTCCGCA
AATAACCCCA GTCCCTTGGG TTATTGGAAG AGCCTATTAA GAAGAGGAGA GCTGGACTAC
GTCTTCAAGC CGCGGGAAGA CCCCCTTCCC TGGGAGCACG TGCGGGGCTT CTATCAGCCC
GGGGAGCTTA GAAAGAGGTA CGAGAAATTC CTAGAAGAGG CTTGTGCCTA G
 
Protein sequence
MAYRKNAVKI ALLYPSTYSV AMSSSIYHVL YFKLQDAGFY VERFTADRGP RGVEDGTPLT 
HFDHILATVH YELDYINLVK MLIDAGIPPE AGRRKKPKLI IGGPPVTANP EPLAEFADAM
ALGELEALWE PLLAYLSTRE EAEGLYYPAR GSHPVSIAYA PDVREVDYRR LPEPESAFSI
SIEAARGCPF SCLFCMESYI TKPYRPRDWI TVVNEAERLY KKSGVRPSLV ALTANSHPHF
KEILRAAVER GLPLSLPSLR AELLDDEALE LIARLGQRTL TIAPETSERL RKALGKNFTN
QDVIRVAKKA SQLGLKLKLY LMVGLPCEKE DDLKEVVELA KQVKRVGAYL YLSVNPFVPK
PQTPLQYHPM APLGYLRKSL SEIRKAPHDE YSQYDTTLAA IQAAISLGGR EVSRHIEASA
NNPSPLGYWK SLLRRGELDY VFKPREDPLP WEHVRGFYQP GELRKRYEKF LEEACA