Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1633 |
Symbol | |
ID | 5055371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1472131 |
End bp | 1473960 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640469173 |
Product | hypothetical protein |
Protein accession | YP_001153838 |
Protein GI | 145591836 |
COG category | [R] General function prediction only |
COG ID | [COG1579] Zn-ribbon protein, possibly nucleic acid-binding |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAAGA GGGATTTTCT AAAGCTCCTC TCAAGCTTTG GCTTGGGAGT AATAACCGCA GAGCTGTACG AAAGGCTTTT TCACATCCCC GCTTTGGAAA AGGCCTTTAG GGAAGAGGTG ACCTACTGGA TTGAGCAGTA TCGCAGAGCT AAGGAGCGAC TAGAGACCGT AAGCCGGAGG GCAACAGCCT TAGAGGACGA AGTGCGGAGG GCGCGGGAGG AGGTGCAGAA GGTGGGCAGA GAGGTCGCTT CACTTGAGAC CTTGCTTAGG GAGAAGGATG ATGAGGTAGC GGCGTTGAGG CAGGCACTGG CGTATAGGGA TCAGCTGGAG GAGGAGGCTC TTAGAGCGGT TTATCAAGAA AAGCTGGAGG AGGCTATTAG CGGGCTGAGG AGAACGGTTG AGAAATACAG GGCGTTGTTG GGCGAGGATA AAGTGGCTTT TGAATCCGCC GTGGTTAAGA TACTCGAGGA GTACAAAATA ACGCAGGAGA AGCTGGCTAG GCTAGAGGGC ATGTTCCCGC TAATCTTCCT CAGCTGGACG CCTGCGAGGG TTGTGCTGGA CAAGATCTAC GACGTGCGGG TAGAAGCAGA GATCGTAAAC CCGCTTACGC CAGTAACTGA GGTGGAGATA AGCCTCGTCC CAGTGGAGTA CAGATACATG ATACAGCGAT ATGGAATGAC GGAGGAGGAC TACCACAAGG TGTTTCCGAG AGAGGAGGTA AAAACAGTTA AGTTCAGGGC TAGAGGCTTA ATTAGAGAGG TCTTCTCCAC TGTCTTCGAA AACCTAGTAG GCGGGAGGGA GTACGTAATC AAAGTCGTCG TGAGAGATCT ACTAAACAGG ACAAAAAGCG TAGAGGCGAA AACCCCTTAT ATAAGACAAT ACGAGAACTT CGCCGCATCC AGTCGTATGA ATGTCGGCAC ATATTATTAC CCTTGGTATG ATCCCGCGGG AACGTGGTTG AGATATACCT TGGAGACCCC CCTCCTTGGC CAGTACAGCT CAAGGGATCC AGTGGTTATC AGTAAGCATA TTGACTGGGC AAGTGGCCAC GGCATTAATT TCCTCGTCGT CAGCTGGTGG GGACCTGACT CCTTTCCAGA TATTGTTTTA AGAAATTATA TTTTAACGAA TTCATTAATC AAAGATATAA AAATAGTAAT TTTCTATGAG ACACTTGGAA GGCTAAAAGT TAAAGAAGCT GATCAGAAAA TAGAGCTCGA TGACGAGAAT AAGAAAACTC TCTTAAGCGA TTTCGCATAT TTAGCAAGGT ATTTCGCCCA CCCCTCTTAC TTAAGGATTG ATGGAAAGTG CATCGTGGTG ATATATTTAG CTAGGATTTT TGAGGGAGAT GTTAAGGGCA CTCTTGCTGA GATGAGGAGT AGTATGCAGA GGGTGGGATG CCCCATCTTC ATAATCGGAG ACGTCGTATA TTGGCACAGC CCCGATAGAA AAATGATTAA GCTCTACGAT GCTGTTACAG CCTATAGCAT GTATACAAAC ATTCCACAAG TGTTGAGCGA TTTTGAGGAC AAAGTGTCTT GGAAATACGG CGAGTGGTCT GAGGCAACTA ACGCTCTTGG AGTTGGCTTT ATCCCATCCG CGATGCCCGG ATTTGACGAC CGGGCAATAA GGACGGGACA TATTCCGCTT CCTAAAAGCA CAGAGAGATT TAGAAAACAA CTCATTATTG CAAGACAATA CACCAACATT AATACAATTC TTATCACTAC ATTTAACGAG TGGCACGAAA ATACCAATAT TGAGCCGAGT GTAAAAGACG GCTTTTCATA TTTACAGGTT CTGAAACAAG TGTTACTTGA AGGGACGTAA
|
Protein sequence | MNKRDFLKLL SSFGLGVITA ELYERLFHIP ALEKAFREEV TYWIEQYRRA KERLETVSRR ATALEDEVRR AREEVQKVGR EVASLETLLR EKDDEVAALR QALAYRDQLE EEALRAVYQE KLEEAISGLR RTVEKYRALL GEDKVAFESA VVKILEEYKI TQEKLARLEG MFPLIFLSWT PARVVLDKIY DVRVEAEIVN PLTPVTEVEI SLVPVEYRYM IQRYGMTEED YHKVFPREEV KTVKFRARGL IREVFSTVFE NLVGGREYVI KVVVRDLLNR TKSVEAKTPY IRQYENFAAS SRMNVGTYYY PWYDPAGTWL RYTLETPLLG QYSSRDPVVI SKHIDWASGH GINFLVVSWW GPDSFPDIVL RNYILTNSLI KDIKIVIFYE TLGRLKVKEA DQKIELDDEN KKTLLSDFAY LARYFAHPSY LRIDGKCIVV IYLARIFEGD VKGTLAEMRS SMQRVGCPIF IIGDVVYWHS PDRKMIKLYD AVTAYSMYTN IPQVLSDFED KVSWKYGEWS EATNALGVGF IPSAMPGFDD RAIRTGHIPL PKSTERFRKQ LIIARQYTNI NTILITTFNE WHENTNIEPS VKDGFSYLQV LKQVLLEGT
|
| |