Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1431 |
Symbol | |
ID | 5054996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1289422 |
End bp | 1290558 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640468972 |
Product | hypothetical protein |
Protein accession | YP_001153641 |
Protein GI | 145591639 |
COG category | [S] Function unknown |
COG ID | [COG3372] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.813641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00133471 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGATACCAA TAGACTACCT CCGAGTGGCG AAGAGGGGAA GGGAGGTAAA GCCAAAGTAC CTACGCGACG AGAGGGCGGC GGCAGAGGTG ATCCAGGTAG CCAAGTCCGC TAAGACAGTT GGGCAGTTTA GAAAGGCGGC TGAGGCGACG AGCCCCGACA AAAAGCTCTC AAGAGGGTTG GCGCATCTAC TAGAGCGGCA TATGGAGTTG GAGAAACTCG ATGCCAAGCT AGTCGCCAGG GTGAGGATGG AGGTTTTCAA AACAGCATCG CAAAGAGGCT ACCCCCTAAC GCCTGAAGAG AGGGAAGAAA TTTTTCGCCT CGTAGCGTCA AGGCTGGGTC TAGGCGTTGA CGAGGTGAAG AAGTTATTCC TAAAGGCCTA CGAAGAGAAT AGGGAAATCG TAAAGCTCCC AGACATCGGA CCCGATGACT TGACAAGACT TTACAACCTG GCGCTTATAC AGGCACTTCT TTTCAAGTCG CTATACGTAA AGGCCAAGCT CCCCAACTCG CCTACTCACA TCAAGAGCTT GATAAGAGCC GTCAAGGGCT ACCGCCTAAT GTATATTGCT GAGGCTAGAG GCCAATTCCT TGAGTTCGCC TTCGACGGCC CTGTGTCCGC GTTGCGGCAG ACCGAGAGGT ACGGGACTAG GCTAGCGAAA CTCGTGCCAT ATATCACCTC GGCTGATCGG TGGGAGATAG AGGCGGAGGT GAAGGTGGGG GAGAGGAAGT ACCTTTTTAG AGAAAGCTGG GAGACCGCCC CGCCCCTGCC CAAAGTTGCC GTAGAGGCGG AGGAGTTTGA CAGCTCGATA GAGCTGGAGT TCTACCGACA GGTTTCGAGG CTCTGCAAAG TGGAGAGGGA GCCCGAGGCT ATTGTGGTAG ACGGGAGGAT ATACATCCCC GACTTCAAAA TAGGCGACCT ATACGTGGAG ATAGTCGGCT TCTGGACGCC GGACTACCTC AAACGAAAAT ACGAGAAGGT GGTGAAGGCG GGGAAGCCCA TCTTAGTCCT CGTGGCGGAG GAGTTAGCAA TGGCCACTTG GAAGGAGCTT ATGCCAAACG TAGTGATTTT TAAAGGGAGG CCGAGGCTAA GCGATGTGTA TAAGTACATA AAGCCTTACT GCTCTGCCAG GCGTTAG
|
Protein sequence | MIPIDYLRVA KRGREVKPKY LRDERAAAEV IQVAKSAKTV GQFRKAAEAT SPDKKLSRGL AHLLERHMEL EKLDAKLVAR VRMEVFKTAS QRGYPLTPEE REEIFRLVAS RLGLGVDEVK KLFLKAYEEN REIVKLPDIG PDDLTRLYNL ALIQALLFKS LYVKAKLPNS PTHIKSLIRA VKGYRLMYIA EARGQFLEFA FDGPVSALRQ TERYGTRLAK LVPYITSADR WEIEAEVKVG ERKYLFRESW ETAPPLPKVA VEAEEFDSSI ELEFYRQVSR LCKVEREPEA IVVDGRIYIP DFKIGDLYVE IVGFWTPDYL KRKYEKVVKA GKPILVLVAE ELAMATWKEL MPNVVIFKGR PRLSDVYKYI KPYCSARR
|
| |