Gene Pars_1633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1633 
Symbol 
ID5055371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1472131 
End bp1473960 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content47% 
IMG OID640469173 
Producthypothetical protein 
Protein accessionYP_001153838 
Protein GI145591836 
COG category[R] General function prediction only 
COG ID[COG1579] Zn-ribbon protein, possibly nucleic acid-binding 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAAGA GGGATTTTCT AAAGCTCCTC TCAAGCTTTG GCTTGGGAGT AATAACCGCA 
GAGCTGTACG AAAGGCTTTT TCACATCCCC GCTTTGGAAA AGGCCTTTAG GGAAGAGGTG
ACCTACTGGA TTGAGCAGTA TCGCAGAGCT AAGGAGCGAC TAGAGACCGT AAGCCGGAGG
GCAACAGCCT TAGAGGACGA AGTGCGGAGG GCGCGGGAGG AGGTGCAGAA GGTGGGCAGA
GAGGTCGCTT CACTTGAGAC CTTGCTTAGG GAGAAGGATG ATGAGGTAGC GGCGTTGAGG
CAGGCACTGG CGTATAGGGA TCAGCTGGAG GAGGAGGCTC TTAGAGCGGT TTATCAAGAA
AAGCTGGAGG AGGCTATTAG CGGGCTGAGG AGAACGGTTG AGAAATACAG GGCGTTGTTG
GGCGAGGATA AAGTGGCTTT TGAATCCGCC GTGGTTAAGA TACTCGAGGA GTACAAAATA
ACGCAGGAGA AGCTGGCTAG GCTAGAGGGC ATGTTCCCGC TAATCTTCCT CAGCTGGACG
CCTGCGAGGG TTGTGCTGGA CAAGATCTAC GACGTGCGGG TAGAAGCAGA GATCGTAAAC
CCGCTTACGC CAGTAACTGA GGTGGAGATA AGCCTCGTCC CAGTGGAGTA CAGATACATG
ATACAGCGAT ATGGAATGAC GGAGGAGGAC TACCACAAGG TGTTTCCGAG AGAGGAGGTA
AAAACAGTTA AGTTCAGGGC TAGAGGCTTA ATTAGAGAGG TCTTCTCCAC TGTCTTCGAA
AACCTAGTAG GCGGGAGGGA GTACGTAATC AAAGTCGTCG TGAGAGATCT ACTAAACAGG
ACAAAAAGCG TAGAGGCGAA AACCCCTTAT ATAAGACAAT ACGAGAACTT CGCCGCATCC
AGTCGTATGA ATGTCGGCAC ATATTATTAC CCTTGGTATG ATCCCGCGGG AACGTGGTTG
AGATATACCT TGGAGACCCC CCTCCTTGGC CAGTACAGCT CAAGGGATCC AGTGGTTATC
AGTAAGCATA TTGACTGGGC AAGTGGCCAC GGCATTAATT TCCTCGTCGT CAGCTGGTGG
GGACCTGACT CCTTTCCAGA TATTGTTTTA AGAAATTATA TTTTAACGAA TTCATTAATC
AAAGATATAA AAATAGTAAT TTTCTATGAG ACACTTGGAA GGCTAAAAGT TAAAGAAGCT
GATCAGAAAA TAGAGCTCGA TGACGAGAAT AAGAAAACTC TCTTAAGCGA TTTCGCATAT
TTAGCAAGGT ATTTCGCCCA CCCCTCTTAC TTAAGGATTG ATGGAAAGTG CATCGTGGTG
ATATATTTAG CTAGGATTTT TGAGGGAGAT GTTAAGGGCA CTCTTGCTGA GATGAGGAGT
AGTATGCAGA GGGTGGGATG CCCCATCTTC ATAATCGGAG ACGTCGTATA TTGGCACAGC
CCCGATAGAA AAATGATTAA GCTCTACGAT GCTGTTACAG CCTATAGCAT GTATACAAAC
ATTCCACAAG TGTTGAGCGA TTTTGAGGAC AAAGTGTCTT GGAAATACGG CGAGTGGTCT
GAGGCAACTA ACGCTCTTGG AGTTGGCTTT ATCCCATCCG CGATGCCCGG ATTTGACGAC
CGGGCAATAA GGACGGGACA TATTCCGCTT CCTAAAAGCA CAGAGAGATT TAGAAAACAA
CTCATTATTG CAAGACAATA CACCAACATT AATACAATTC TTATCACTAC ATTTAACGAG
TGGCACGAAA ATACCAATAT TGAGCCGAGT GTAAAAGACG GCTTTTCATA TTTACAGGTT
CTGAAACAAG TGTTACTTGA AGGGACGTAA
 
Protein sequence
MNKRDFLKLL SSFGLGVITA ELYERLFHIP ALEKAFREEV TYWIEQYRRA KERLETVSRR 
ATALEDEVRR AREEVQKVGR EVASLETLLR EKDDEVAALR QALAYRDQLE EEALRAVYQE
KLEEAISGLR RTVEKYRALL GEDKVAFESA VVKILEEYKI TQEKLARLEG MFPLIFLSWT
PARVVLDKIY DVRVEAEIVN PLTPVTEVEI SLVPVEYRYM IQRYGMTEED YHKVFPREEV
KTVKFRARGL IREVFSTVFE NLVGGREYVI KVVVRDLLNR TKSVEAKTPY IRQYENFAAS
SRMNVGTYYY PWYDPAGTWL RYTLETPLLG QYSSRDPVVI SKHIDWASGH GINFLVVSWW
GPDSFPDIVL RNYILTNSLI KDIKIVIFYE TLGRLKVKEA DQKIELDDEN KKTLLSDFAY
LARYFAHPSY LRIDGKCIVV IYLARIFEGD VKGTLAEMRS SMQRVGCPIF IIGDVVYWHS
PDRKMIKLYD AVTAYSMYTN IPQVLSDFED KVSWKYGEWS EATNALGVGF IPSAMPGFDD
RAIRTGHIPL PKSTERFRKQ LIIARQYTNI NTILITTFNE WHENTNIEPS VKDGFSYLQV
LKQVLLEGT