Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1069 |
Symbol | |
ID | 5055376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 952458 |
End bp | 953558 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468625 |
Product | hypothetical protein |
Protein accession | YP_001153299 |
Protein GI | 145591297 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTGCG GTCTGCCGAG GTGCCCCATC GAGGAGAGGA TTAGGGCCGT GAAGTCCTCC CTCCTCAAGA TCCGGGGCCG GGAGGTCTTC GGCGCCACGC CCCCCAGCGC AGTTGTCGGC GAGGCTGGGT GGCCGCGGGT GAGAGTCTAC ATCGGTGAGC CCCCCGAGGT GACAGGCGAG GAGGCCAGAG CTTACGACGA CCCGCGGCTG CTCTGGGGGA GGGAGCTGGA GGAGATTCTA AGGCTCAGGA GCTACATGGT ATTCGGCTAC GCGGCCCAGA CAAGCCCGCG GAAGCTCGGC GAGTTGCCCC TCCTCGCCGT CTCCGAGAGG CCGGTGGACG TGGAGATGCG CCTAGCCAAA ACCCCTGTGG AGAGCCTCAA GTTCGACCTA AGGGAAAAGC CCATGGGCCC CAGGGCGCCT CTCGAGGCCT TGAGGATAGA CGGCAACCCG GCGGTGCCGC GGGCGTTGGA CAAGCTGATG TCCGACGATC TGGGCGCCGG GGCTGCCGCC GTTGAGCTGT ACAGAAGGGG TGTCGACCTC TACACTATCC AGAGGGCCTT CGCCCTAGGC CTACTCGGGG CGAGACACAG GCGGAGGCTC GTCCCGACGC GGTGGAGCAT AACCGCAGTC GACGTGGCTA TCGGCGATGC CCTGGCGCAA CAGGTTAGGC ATATGCCGGA GGTTTCACAA CCCCTATACG GATACGCCGA GTATCTAGAC AACCGCTACC TGGTCGCCGT GGTCCCCGGC CCGCTTAGGT TCTACTACCT GGAGAGGTGG ACATATGCAG GAAGAGTCGC CGAGATAGAG GTGGCGGAGG ACCCCCGGGG AGTGCGAAGC ACCATGGACG GCGGCTACGA AGCCGCCAGG CTGGCGATAC TGGAGAAGCT GGCCTCAATG GGCAGACGAG GCACTGTGTC AATAGTGAGG TGGATAGGCG AGAGGTACTA CGTCTCGGTG GGCAACTGGC AGATAAGAGA AACCCTACGC AGACTCCAGC TGAAGCCGCT AGACGAAAAC TACAAGACAT ACACCGCGCT GGTAGGGAAA GACCCGATCT CACTCATAAA AAATACTAAG AGACTAGACG AGTTTCTCTA A
|
Protein sequence | MLCGLPRCPI EERIRAVKSS LLKIRGREVF GATPPSAVVG EAGWPRVRVY IGEPPEVTGE EARAYDDPRL LWGRELEEIL RLRSYMVFGY AAQTSPRKLG ELPLLAVSER PVDVEMRLAK TPVESLKFDL REKPMGPRAP LEALRIDGNP AVPRALDKLM SDDLGAGAAA VELYRRGVDL YTIQRAFALG LLGARHRRRL VPTRWSITAV DVAIGDALAQ QVRHMPEVSQ PLYGYAEYLD NRYLVAVVPG PLRFYYLERW TYAGRVAEIE VAEDPRGVRS TMDGGYEAAR LAILEKLASM GRRGTVSIVR WIGERYYVSV GNWQIRETLR RLQLKPLDEN YKTYTALVGK DPISLIKNTK RLDEFL
|
| |