Gene Pars_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1111 
Symbol 
ID5055485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp997811 
End bp998860 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content65% 
IMG OID640468667 
Producthypothetical protein 
Protein accessionYP_001153341 
Protein GI145591339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000841175 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGGGGT TGGGGGGCCG GCGGGGTTTG TGGGGGTTTG TTTTTATTTG GGGTGTTGTG 
TGTTTTGTGG ATGAGTTGTT TGTGTTGGCT GGGTTGCTTG GGGTGTTGGG TGAGAGTGCG
GGGGAGCCGT CTGTGGAGCT TGTGGGGGCG CGGGGGGAGG GGGGTGTGTC TTCTTGTGGG
TACTACGTGG TGAGGCGTGG GTTTGGGGTG CCGCCGGGGG ATGTGCATGG GTTGGATAGC
CACACGTCGG TAGTGGAGTT TGAGGGGGTG TCTGTTATTG TGGCTACTGG GGCGTTGGTG
GGGCGGGTTT TGGCCACGGT ACCGGGGGTG GCGGCGAGGT GGCTGGGGGT TAGGCTGAAT
TTTAGGGGTG GGGTTGAGTT GCTGGAGGGT TCTGGGCTGT ATTACAGGTC GGAGGTGCTT
GGGGTGCCTT TCGATGCTGG GTTTGACTTG GAGGCGGCGA GGGATGAGGT GAGGTACCAC
GTGGAGGGGG TGTTGGCGGG GGTGTGGGAC GGGGGTGGGG TGTTGCTGGT GGATGGGCCG
GTGTTTAGGG TTCCCGACGT GTACCAGAGT GGTGGGGGGT TTTTCCGGCT GTATCTGGAG
TTGGCTAGGG CGAGGGCGGC GTTGCTGAGG GGGGCGGTGG GGGTTGTGAA GAGGGTGGAG
CGGTCTCGGT ACTTGGCGAG GTGTGCCGGG GTTGGGTCGG ACGACGAGGT GGCGGCGCGG
CGGCTTTTGA ACAACTCGCC GGGGTACGTG GGGCCGGTGG TGGTGGAGTG GGAGGGGCTT
CGTAAGTACT TGTTCTACGT GGCTGTGCCT GCGCCGAGGG GGGTTCGGGT GTTTCGGGTG
GAGGCGCTTG AGGAGGGGCT TGCGGAGGAG GCGGCGTCGT GGCTGGGGTC GCTTTCAGAC
GCCTCTGGGT TGCCTCTGCC GCTGGCTGTG GCGGATAGGG TGGCGCGGCG GCTGAACGCG
GCGGCTGTGA AGCTCCTCTA CGCCGCTTCG CCGGTGGAGC CGACGTACCG GGGGCTTGAG
GTGGTGCAGG CGGCGCTGGG GGAGCTGTGA
 
Protein sequence
MWGLGGRRGL WGFVFIWGVV CFVDELFVLA GLLGVLGESA GEPSVELVGA RGEGGVSSCG 
YYVVRRGFGV PPGDVHGLDS HTSVVEFEGV SVIVATGALV GRVLATVPGV AARWLGVRLN
FRGGVELLEG SGLYYRSEVL GVPFDAGFDL EAARDEVRYH VEGVLAGVWD GGGVLLVDGP
VFRVPDVYQS GGGFFRLYLE LARARAALLR GAVGVVKRVE RSRYLARCAG VGSDDEVAAR
RLLNNSPGYV GPVVVEWEGL RKYLFYVAVP APRGVRVFRV EALEEGLAEE AASWLGSLSD
ASGLPLPLAV ADRVARRLNA AAVKLLYAAS PVEPTYRGLE VVQAALGEL