Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0983 |
Symbol | |
ID | 5055103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 873968 |
End bp | 875818 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640468539 |
Product | hypothetical protein |
Protein accession | YP_001153215 |
Protein GI | 145591213 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.495624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000164054 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAATAG GCTACGTCGT GGCCACGGCG ACGCCGTTTG AGTTCGTGGC GACGCTGGAT CCCGAGAGGC CTGTTAGTCT GTACGACTAC GTGGTGGTTG ACCACGTGGA GCTCGACAAC GCCTCCGGCG AGCTTGTAAA CGTCAGTTTA CTGGGCCAGA TAGTGAAGCT TTACCGCGAC CCCTACTCGG TGAAGAGGGA TCTGCCGCTC TACACCGTCA TACAGGAGGT CTCTAGTAAT ATTTTGGAAG TTCAGATTGC CAAGGTCAAG GTGCTTGGCT ATGTGCTAAA CGGTGAGTTG AGGCAGCCGA AGCAGCCGCC GAGGATAGGT TCGCCGGTCT ACTTGGCGGA GAACGAACAA ATCGCCGAGC TGTTTAAGGT GGAGAACGGG CTGTGTGTCG GCAAGCTTGC AAGCCGCGAT GTGGCTGTGT GTCTAGATAT AAACGGTATT AGGAGACACC TTGCGGTAAT TGCGGCGACG GGCAGTGGCA AGACTTGGTT TTCGGTGGTG TTGATAGAGG AGTTGCTGAG ACGAGGGGCT AAAATTGTGG TCATAGACCC ACACGGCGAA TACGTAGCAA TAAAAGACTC AATACACCGC CTAGGTCCCT TCACTGCGAG GGTTGTGAAG GTGTCGAAAC ACCACGTGGG GGACTTAATG TACAAGATAG GTGTTCTTGA CAGTGATCCA GAAGCGTTGG CAAACGCCGC GGGCGTACCG CCTGGCGCTA AGAAGATAAG ATATGCGATC TACCTCGCAT GGTCCTATGC GAAGAAGGTT AGGAAAGCCA CTGGGGAAAA AGTCGGCTTG GCCTTTATGA AAAGAGTCCT ATACACAGCC ATGAGGGGGG AAAACGCCTT GCAAAAACTT TTCCAGCAGT ACAAAGGAAT CAACGACGGC GCTCACAAGG CCGAGGGAGA TTTTCCCCTA AGCGATTTAA AGCAACTCGC CGCCAAGGAC AGACACGCCA TTTTCAGCGC GTTGACGTAT TTAAAAAAGC TGTCTAGGCT GGGAGTCTTC TCGTCTAGGT CAACCCCTCT CTCGAAGCTT CTGGGCGACA TTACGATTAT CAACCTGGCA GGGGTAAACG AGGAGGTCCA GGACTACGTG GTGTCGCACT TGGTGAATAG GCTCTTCCAA GCTAGGGTGA ACCACGTCAG GGGGTTAAAG GGGTACCAAC TCCCGTGGCC CATAGTCTTG TTCGTAGAGG AGGCTCACAG ATTCGCCCCT CCAAAGGCAC TAAGAAAGAC GAGGTCTTAC GAGGCCTTGT CCCGGGTCGC CTCAGAAGGG CGCAAGTTCG GCGCCTACCT CGTAATTATA AGCCAGAGGC CTAGCAAGGT CGATCCTGAC ATAATTAGCC AGTGCCAGAG CCAAGTAATA ATGCGGATAG TCAACCCCAA AGACCAAGAG GCGGTTAGAG AGAGTAGCGA ACTGTTGGCG CAGGAGTTTC TAGAAAACCT GCCCGGGCTG GACGTGGGCG AGGCTGTGGT GTTGGGACCC ATCGTGAAAC TCCCCGTAGT GATAAAGGTG AGGGACAGGG TGCTTGAATA CGGCGGATCT GACATAGATC TCACAACGGC GTGGAAGGTG GATAAGACCG CCGACGTGGC GCAGATGTGG AGGAGGATAT TCAACAGCCC GCCTCCTCCA AGCGTTATGC TGTCGGCATC TAGAATGAGG CTACTCCACA AAAAGAGGGA GGGGAATAAA ATCGTCATTA AGCTCCTCGA CGGGGATAAG GAAGTGGACG TGGTAATCGA GGGCGGCTCC CCCCGCTGTA GTGTCTGCGG CGTCGGCAAG CCGTGTAGCC ACGTGTATAA GGCACTTGAA GAGGCGCTAG AGGTGGTATG A
|
Protein sequence | MRIGYVVATA TPFEFVATLD PERPVSLYDY VVVDHVELDN ASGELVNVSL LGQIVKLYRD PYSVKRDLPL YTVIQEVSSN ILEVQIAKVK VLGYVLNGEL RQPKQPPRIG SPVYLAENEQ IAELFKVENG LCVGKLASRD VAVCLDINGI RRHLAVIAAT GSGKTWFSVV LIEELLRRGA KIVVIDPHGE YVAIKDSIHR LGPFTARVVK VSKHHVGDLM YKIGVLDSDP EALANAAGVP PGAKKIRYAI YLAWSYAKKV RKATGEKVGL AFMKRVLYTA MRGENALQKL FQQYKGINDG AHKAEGDFPL SDLKQLAAKD RHAIFSALTY LKKLSRLGVF SSRSTPLSKL LGDITIINLA GVNEEVQDYV VSHLVNRLFQ ARVNHVRGLK GYQLPWPIVL FVEEAHRFAP PKALRKTRSY EALSRVASEG RKFGAYLVII SQRPSKVDPD IISQCQSQVI MRIVNPKDQE AVRESSELLA QEFLENLPGL DVGEAVVLGP IVKLPVVIKV RDRVLEYGGS DIDLTTAWKV DKTADVAQMW RRIFNSPPPP SVMLSASRMR LLHKKREGNK IVIKLLDGDK EVDVVIEGGS PRCSVCGVGK PCSHVYKALE EALEVV
|
| |