Gene Pars_0779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0779 
Symbol 
ID5054936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp694909 
End bp696510 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content49% 
IMG OID640468338 
Producthypothetical protein 
Protein accessionYP_001153017 
Protein GI145591015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGAA ACGTTTTGTT GGGAGTTCTA TTATTTGCCG CCGCGATTGT GGTATTGCAA 
ACGTGGGTAG ACGCTGTTGC GCAGAGAACT CCGTGGGGCA AAATAGTAGT GCAAAGCGGA
GACGGGCAAG ACGAGATAGA AGTAGTAATC TTAGAAGGGG GGAGGAGGTC AGACCGCGAC
TACGACTTCG CAATTGTGGA GATGGACGGA GAGATACGCA CAAGACGGGC GGAGAGGGGC
TACGGCAAAG TGGTGCTGGG CGCCTACCAC GCCAAACTAG CTCAAATAGC CAAAGAACTC
GGCTACAACC CAACCGACGT TGAGCTGGGA GTTATTATTG AAATCGGGCG CCTCGTAGAC
CACAACGAGA CACACATAGC GGCAGAGAGA ATGATAATCT CAGCCCCCCT AAAGCCTGGA
AAGCCAAACA AGATAAGGGT AGAAGTAGAA TTCAAGCCAC AAGCCAAAAC ATTGGTGCCC
AAGCCAAAAA ACTCAACAAA GCCACACACG CGGAGCACAT ATGAAGTCCA AGCCTCTACC
TCGCCACCCA GCTTAATTGA ACAGGGCTGT CCCGTGATAA AAGACCCATT TTCCAACCCG
GTATATACAT GTTATGAGTG GAGGCTTGTG TCGAGCACGA AGAGCCAAGA CACGCTAATC
CCAAGCATGA TCGTATACCT AGGCGCCAAC GATGTATACA ACGCCAAATC TGTGTCCGTA
TATTCAGAGA TAAAAGTAGG TCAAGACACA GGCCTGAGGC TGACTCTTCC CATAGGCGCC
GTCTTATCTA AAGAGGGGAA GACATATCTA CTAACACAAG GATTTACAGT AAACCTAGGC
AGTAGTACTA CGTTAATGCA AGCAAATTGT CTATTTAGAT CGAATAAGGA GACCCCCGCT
TATTCAGAGT GTAGAGACAT GCTAAGAGGC ACCTCTACTA CATTGCCAAC TTATGATAGT
AGTAGTCCTA CAGGGGCGTG GGCGACCGTA GGGCCGCGGG GTGTATATTG GCAGGTGACA
TACGACATCT ACTACGTGGC CTATGTGTAC AACTGGACGT TGAGAAAATA CGTAATTACG
TACCAGTATA AAGTCGACAC AGCTACAGGC GTCTGGTCAG CGCCCTACGC AGAGCAGGTG
GGCAACGGAC AGTATAGATT CTGGCCTGAA TTTAGAATCA ACTCATATGA AATAGGCCGG
GTCGTAACTT ACTACAATAC ACCCGAGTGG AGTGGAGGAC CGCCTAAATT CACGATTCAG
AGCGGGGCAA GTTCTTACAT TGCCATAGGT GTATATCAAA TCGTAAGTGA AGCTAATACA
CTGTTTTCAA CTACGCCGTT TGCAGTACCC GCAACCCTTA TTGCTTGTGC TTATACAGGT
GGCTCCTTGT GCACTATTGC CGCCACCGCG GCTACTTCGT TTGACGTTTC TTATGAGACG
TCTATGTATT TAAGCCAGTT GGGAATAGTG CAAGCCGATT TGAGGTGTTC GGTGCCGTAT
GAAAGCACAG GCTTTACTAT AAAGAACTAC CTAGAAGGCA TGGGCGTTGA GGTATATCTC
CCCGCGGTGT TTATACATCT CCGCACCACT GCACGTTGTT AA
 
Protein sequence
MLRNVLLGVL LFAAAIVVLQ TWVDAVAQRT PWGKIVVQSG DGQDEIEVVI LEGGRRSDRD 
YDFAIVEMDG EIRTRRAERG YGKVVLGAYH AKLAQIAKEL GYNPTDVELG VIIEIGRLVD
HNETHIAAER MIISAPLKPG KPNKIRVEVE FKPQAKTLVP KPKNSTKPHT RSTYEVQAST
SPPSLIEQGC PVIKDPFSNP VYTCYEWRLV SSTKSQDTLI PSMIVYLGAN DVYNAKSVSV
YSEIKVGQDT GLRLTLPIGA VLSKEGKTYL LTQGFTVNLG SSTTLMQANC LFRSNKETPA
YSECRDMLRG TSTTLPTYDS SSPTGAWATV GPRGVYWQVT YDIYYVAYVY NWTLRKYVIT
YQYKVDTATG VWSAPYAEQV GNGQYRFWPE FRINSYEIGR VVTYYNTPEW SGGPPKFTIQ
SGASSYIAIG VYQIVSEANT LFSTTPFAVP ATLIACAYTG GSLCTIAATA ATSFDVSYET
SMYLSQLGIV QADLRCSVPY ESTGFTIKNY LEGMGVEVYL PAVFIHLRTT ARC