Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1167 |
Symbol | |
ID | 5054906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1054060 |
End bp | 1055379 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468717 |
Product | hypothetical protein |
Protein accession | YP_001153390 |
Protein GI | 145591388 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.018342 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTCC CAATATCTAC AATTATATTG TTGGCGGCTC TTGTAGCCGC CCAACAAATA TATGTAAATG TCTTGGCAGA CCTCTCCCAC GGCGAGGCGG AGAAGGGCCT CGACTTCTGG GTTAACTCCA CGACAAACCC CCTAGCGATA TCGGACTTCG CTAAGCTGTA CATCTTGTTG CCCCCCGGCG CCAAGCCGGG CCCAGTCGTC GCCAAACTAA ACGCCACGAA GTCGGCAGTC CTCCTCACAG GCGACCTATC CACAGTAGAC CTGAACCAGT TTAAGGTAAT TGTGCTGGGC CAGCCGACGA AGCCTCTGTC AGACGCCGAG TTGGCTGCTG TTAAGAAGTG GCTCGACGGA GGCGGGAGGG TGCTGTGGTG CGCCGCCGAC TCCGACTACC CGGCTCAAGG CTCCGAGGAG TCGCAAGTGG CGTGCAACGA CATCGCCGAA TACCTCGGCG CCCACATCCG CGTCGACTAC GTGTCTGTAG AAGACTCCCA ACACAACGCC GGGGCTGGCT ACAGAGTTGT CGGCGTGGTC GACCCGCCGC CTCAGCTCTC CTTCCTCGGC TTCATGGCCC AGAGAGTGCT CTTCCATGGC CCAGGCGCTG TTGCTGTCGT CCTCCCTGAC GGGAAGTGGA TCCCCGCCAC GAGCCCCGAG GTGCAGAAGT ACTACAATAA CGTATTCGTA ATTGTACATA CAACCCCCAC CGGCACCATT GTGGAGAGCA GAACCTCAGC CGACGGAAAG GGGAGAGATG GAAAGGCCCA CACGGCGGGA GACAAGGGCG TGTTTGCCCT TATGGCTCTT GAGCTCCTGC CCAGCGGTAG CGTCTTGATA CTATCCGGTG AAACTCCATA CGGTGGCTAC GAGCCCATGC TGGCCCCTGT GTACTACAGA GTGCCCCTAG ACGGGCCGAG GTTTTTCAGA AACTTGATGC TTTGGGCGAC TGGGAACTAC CGCGAGCTCA CCACCATGAT CCAGCAAACG CAACTCTTAT CTCAGTTGCA ACAGGGAATT GAAGCCGTTA GAGGCGACGT GTCTGCCCTA AAAGGCGGCG TCTCGGCCGT GCAGAACGAC TTGGCTTCCC TCAAGAACTC GGTGTCCCAG CTGGGCAACC AGATCAGCGC TGTTTCTCAG AACGTCGCAC CGGTAAAAGA CCAGGTAGCC GCCCTCAACC AAAAGGTGGA CCAGCTGACG CAACAGCTAA ACGCCGCTAT GGCCGAGGCC AACAACGCCA GGACGGTGGC CTTTGTCGGG ACGGCGTTGG CCCTTATATT CGCCATAGCC GCCGCCTTCC TTGCTGTGAG GAGGAAATGA
|
Protein sequence | MRVPISTIIL LAALVAAQQI YVNVLADLSH GEAEKGLDFW VNSTTNPLAI SDFAKLYILL PPGAKPGPVV AKLNATKSAV LLTGDLSTVD LNQFKVIVLG QPTKPLSDAE LAAVKKWLDG GGRVLWCAAD SDYPAQGSEE SQVACNDIAE YLGAHIRVDY VSVEDSQHNA GAGYRVVGVV DPPPQLSFLG FMAQRVLFHG PGAVAVVLPD GKWIPATSPE VQKYYNNVFV IVHTTPTGTI VESRTSADGK GRDGKAHTAG DKGVFALMAL ELLPSGSVLI LSGETPYGGY EPMLAPVYYR VPLDGPRFFR NLMLWATGNY RELTTMIQQT QLLSQLQQGI EAVRGDVSAL KGGVSAVQND LASLKNSVSQ LGNQISAVSQ NVAPVKDQVA ALNQKVDQLT QQLNAAMAEA NNARTVAFVG TALALIFAIA AAFLAVRRK
|
| |