Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1236 |
Symbol | |
ID | 5055439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1118647 |
End bp | 1119693 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640468782 |
Product | hypothetical protein |
Protein accession | YP_001153455 |
Protein GI | 145591453 |
COG category | [S] Function unknown |
COG ID | [COG2855] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGACTA AGTGGACTGA TCCAGCAAAG GCCTTCGCGG TGCCGAGCCC AAAGCTTGTG TTCGGGAGCC CGTGGGTAAA TATCATAGTT GCGTGGGTTG TCTTAATGCT CCTTCTGATG GGCCCCGCAA AGCTAATCGG CGTCAAGCCG AGCGACTGGG CCAAGGGCTT TTCTGTAATC TGGTGGCTGT GGTGGCTTTC CGCATTTATT GTCGGGTATA AGCCCATAGC TGACGTTGTG ACCACAGAAT TTGCCTTCAC CCTAGCCCTC TTCCTTGGGA TGCTCATAGG CAATTTGCCT AAGGTGCATC AGTGGTTGCT CGGATCGGCG AGGGGGGAGT GGTTTATAAA AACGGCAATT GTGCTCCTAG GCGCCAAGAT CCTCTTCACA GACTGGATTA GATACGGAGG CTCTGTTCTC GTCATGGTGC TTATGTCCTT CCCAGTGTTT ATGCTCTTGG CATTCCCCGT GTTCAGGCTC TTCACCAAGA ATACCGATCT AAGCATCGTG GCTTCCGTAG GCATAGGCGT GTGCGGCGTG TCGGCGTCTA TCACAGCGGC CGGCGCCATT GGGGTCCCCG CTATTTACCC CACAGTGGTG TCAGCCGCAA TCCTGATATA CGCGGCGGTT GAGCTCATCA TCTTGCCGTA CGTGGCGCAG TGGCTGGTTA AGGCAGGGAT AATGAGCCCT GCTACCGCAG GGGCGTGGAT GGGCCTCTCT GTTAAGACCG ACGGGGCCGC CGCGGCGTCT GCTGAAATAG TCACCCGCTA CGTGGGGGTT GATGAGCCTC TACGCGTCGG CGTAATGGCC AAGGTCTTAA TTGATATCTG GATGGGGGTG ATCGCCTTTG TCCTCGCCTT GATATGGGTG TTCGTTGTAG AGGTTAGGCG CGGAGTCGCG AGCGGCCGCA GGCCCTCGCC GATGGAGCTC TGGTATAGAT TCCCCAAGTT TGTCCTAGGC TACTTCTTCA CATCGCTGGT AATTTCAGCA CTTATTATGA GCTTAGCCGG CTCTGTATAC GCCACTGCCC CGAACCCCGT CGACTAG
|
Protein sequence | MWTKWTDPAK AFAVPSPKLV FGSPWVNIIV AWVVLMLLLM GPAKLIGVKP SDWAKGFSVI WWLWWLSAFI VGYKPIADVV TTEFAFTLAL FLGMLIGNLP KVHQWLLGSA RGEWFIKTAI VLLGAKILFT DWIRYGGSVL VMVLMSFPVF MLLAFPVFRL FTKNTDLSIV ASVGIGVCGV SASITAAGAI GVPAIYPTVV SAAILIYAAV ELIILPYVAQ WLVKAGIMSP ATAGAWMGLS VKTDGAAAAS AEIVTRYVGV DEPLRVGVMA KVLIDIWMGV IAFVLALIWV FVVEVRRGVA SGRRPSPMEL WYRFPKFVLG YFFTSLVISA LIMSLAGSVY ATAPNPVD
|
| |