Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2334 |
Symbol | |
ID | 5054443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2087323 |
End bp | 2089080 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640469886 |
Product | hypothetical protein |
Protein accession | YP_001154530 |
Protein GI | 145592528 |
COG category | [S] Function unknown |
COG ID | [COG2433] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0927872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00309238 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCTATCC TGGGCATTGA CATTGCGCCC GGCGGCTTGT TTGCCTACGC CGTCGTGGAT AACGACGTCG TGGTGGAGAA GGGCACCGCT AGCGCCAGAG ATCTAGCATC CGTTTTTAAG AAGTATAGAA TTCAAAAGCT GGCCTTGGAC AATCTGGGGG AACTGTTTCA ATACGGCAGA TCTGTGATAA GACTTCTCGG TAAGCTTCCA TATGACGTAA ACGTCGTCGA GGTCACTAGG GTTGGAGAGG GGTATGTTAG AACAGAAGAC TTGGTGAGGC AACATCTCGG GGTAGTGAAG GGGAGGCTGG ATCCGCAGGA GACGGCGATA TACCTAGCCA TGTTAGCCGG ACGCGGAGTG GGGACACCTG TAAAACTCTT CGAGGAGGAG ACCGTGGTGC TTGTCTACAG GCGCATTTCG ACGACCCCCG GCGGTATGAG CAGGAACAGG TACATGAGAA ACATAAGCCA CAGGATAAGA GATATAGCGG CAAGAATTGA GGCAAAGCTA AAAGAGGCCA AGTTAGACTA CGACTTATTC CTAAAAGAGG AGTCCGGCGA AGTCACCTCT GCCAAGTTCA TAGTTTATGC CAGCAAGGAG GTTGTTAGGA GGTATGTAAA GACCATGCGC AGTATCGACG TTGCAGTATC TATATACTCC GCTCCGGCTA AGAAAGGCGG AGTCCCCACC CACGGGCGCT ATCTAATCGT TGGAGTGGAT CCAGGTATAG TGACAGGTGT CGCAGTGCTG ACGCTAGACG GCGAAGTCCT CGACACCTTG GCTAGAAGAG GGTTCTCGCG GGGCGATGTG CTCAGGTACG TACACCAGTG GGGGGTGCCT GTGGTTGTTG CCACGGACGT AGCCGACCCC CCCGAATACG TGAAACGGTT GGCGTCTATG TGCGGCGCAG TGCTCTATGT GCCAAGCAGA GACCTCACGT CGGAGGAAAA GGCAGAGGTG TTAGAAAAAG TGGGCTGGAG GGCTAAGACA ACTCACGAGA GAGACGCCTT GGCGGCCGCG TTTAAGGCAT ATCAGGATTA TAAGCCGAAG TTTGAGAAAA TCGAAAAGGA ATTCGGAGGT ATACTAAAGC CCGACCAGCT TGAATATGCC AGGGCCCTCG TGGCCAAGGG CTACTCCATA GCCCAAGCCG TCTCCGAGGC CTTGAAGAGA CGTGAGGAGA AGGAGACCAA AGTTATCTAC GTAACTGTGG AAAAGCCCTG CGGTTCAAGA GACGAAGCTC TCACAGCTCG TATAAAAGCC CTCGAGTATG AAAACATGGA GTTGCAGAAA GAGCTTGAAA ATCTAAGGCG GGAATATGCG CAGCTAAAAA GAGCGTTTGA GGATGCTAAG TGGCGAGATA TGAAATACAG AGAGCTCCAG AACAGAATAG AGGCGCTTAC AGCGGCGCTG ACGCAGAAAG AGGATGAGAT AAACGCTTTG AAAAACTTGT TTCTGGAAAT ACTCAAAGCT TTCGGGACTC GGTATAAGCT ACTCCACCTA TCAGAGACTG TGGAGTGCAG AGGCGGCGAG GTTGTTGGCA CCGTCTGCAG AAATACAGAA ACTGTAGACG ACGCCGTGGC GCGAAAAACC TTAGGAGTCC CCTTGAGGCT TGTTGCAAAG TTGCAACTGG GAGAGTACTA CGTGATCGAT ATTGACGCCC TCAAGAGACT TACAGACGAA ATAAAGCGGC GCATCGAAGA GAGGCGGGAA ATCGACCTGA GAAAAATCGT GGAGCAGTAC CGCCGAGGGC TAGTATAG
|
Protein sequence | MAILGIDIAP GGLFAYAVVD NDVVVEKGTA SARDLASVFK KYRIQKLALD NLGELFQYGR SVIRLLGKLP YDVNVVEVTR VGEGYVRTED LVRQHLGVVK GRLDPQETAI YLAMLAGRGV GTPVKLFEEE TVVLVYRRIS TTPGGMSRNR YMRNISHRIR DIAARIEAKL KEAKLDYDLF LKEESGEVTS AKFIVYASKE VVRRYVKTMR SIDVAVSIYS APAKKGGVPT HGRYLIVGVD PGIVTGVAVL TLDGEVLDTL ARRGFSRGDV LRYVHQWGVP VVVATDVADP PEYVKRLASM CGAVLYVPSR DLTSEEKAEV LEKVGWRAKT THERDALAAA FKAYQDYKPK FEKIEKEFGG ILKPDQLEYA RALVAKGYSI AQAVSEALKR REEKETKVIY VTVEKPCGSR DEALTARIKA LEYENMELQK ELENLRREYA QLKRAFEDAK WRDMKYRELQ NRIEALTAAL TQKEDEINAL KNLFLEILKA FGTRYKLLHL SETVECRGGE VVGTVCRNTE TVDDAVARKT LGVPLRLVAK LQLGEYYVID IDALKRLTDE IKRRIEERRE IDLRKIVEQY RRGLV
|
| |