Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0394 |
Symbol | |
ID | 5054362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 344178 |
End bp | 345410 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640467961 |
Product | hypothetical protein |
Protein accession | YP_001152648 |
Protein GI | 145590646 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGTGT TTTTATACCA CAGAGCTGTG GGTTTTGTGA GAGGCGACCT CTGCGTCAAG TGCCGCGGCG GGCGTTATCT CTGCGGGTTG TCTTACTGCC CGTTGTTGGT GAGGCAAGCC GCGGCGCCAT TTAGACAGCC GCCGCCTAAG GAGCTGTACG GCTCCAGCCC CCCTTCCGTA TTTGTCGGCA GGATGGGGTA TCCCAAGGTG AGGCTCTACC CATCATCGCC GCCGGAGGTT GGAGACACGA CGCCTTATGA AAACCCCGGG GAGTGGCTTC ACATGTCTCT AGAGCGCTTC CTCGCAATGA GGCTCTCCTT GTACAGAGGA GCCGTCGTGC TTAGAGTTGA AGACGCGGCG AGGCCCCCCA GGTTGCTTCA AGACGTCCAG TTGTTAGCCC TCTCACAAAG GCCAGTTGAG GTGTATCTAC AATTCCGCAA GCCGCCCAGA GGCGTGCATT TCAGCGAATA TTCGCCGCCC ATGGGTCCCT CTGCGCCCGC AGAGAGGGTA GAGGTCGAGG GAACACCCGC CCTGCCCAGA GCCGCCGAAA AGGCCTACTC AGACGTAGAC CTAAAGGCGG CGGAGGCCGT GGTGGAGTTG TACAGACACG GCCTAGAGGT GGCATACATT TCAAGGGCGC TAAGCGTCGG CGCCCTTGGG GGGAGGCGAA GGAGGCTTGT CCCCACGCGC TGGGCGATCA CAGCCGTAGA TAAAATCATT TCAGACCACC TTGTAGAGAA GGTGAAGGAT TATCCCGAGG TAGACGGCTA CTACCTATAC GCCAGGAGGA CGGTGGGGAA CCTCTTCATA GCCATACTGG CGCCGTCTAA GTGGGCGTAC GAGTGGGGGG AGGCCTTTGA GCCTCGCACG GTGTGGAACC CCGGCGGGTC GGTCGAGATG GAGCTGGACT ACGAGCTCTA CGGCGGCCGC CGAGACTACC CGGAAATCGG CGGTTGCTAC TACGCCGCCC GGCTCGCCAC TGCTGAGGCC CTTATGCGGA TGAGGAGACA AGCCGCTGCG ATACTCTGGC GAGAGGTCTA CACAGGCTTC ACCACACCAA CGGGGGTCTG GTGGGTGAGA GAAAACGTGA GGGCGATGTT TAAAGACGAG CCCGCTCGGT TTGACACACT GGAGGAGGCC CTCGAGGCTG CGTCCTACCT CTTGAAAATC CCAATGGAGA GGTGGTTAAC CATGTCGAGA ATAGTGCACC TACTCAAAAA CAGGCTGGTG TAA
|
Protein sequence | MMVFLYHRAV GFVRGDLCVK CRGGRYLCGL SYCPLLVRQA AAPFRQPPPK ELYGSSPPSV FVGRMGYPKV RLYPSSPPEV GDTTPYENPG EWLHMSLERF LAMRLSLYRG AVVLRVEDAA RPPRLLQDVQ LLALSQRPVE VYLQFRKPPR GVHFSEYSPP MGPSAPAERV EVEGTPALPR AAEKAYSDVD LKAAEAVVEL YRHGLEVAYI SRALSVGALG GRRRRLVPTR WAITAVDKII SDHLVEKVKD YPEVDGYYLY ARRTVGNLFI AILAPSKWAY EWGEAFEPRT VWNPGGSVEM ELDYELYGGR RDYPEIGGCY YAARLATAEA LMRMRRQAAA ILWREVYTGF TTPTGVWWVR ENVRAMFKDE PARFDTLEEA LEAASYLLKI PMERWLTMSR IVHLLKNRLV
|
| |