Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1580 |
Symbol | |
ID | 5055458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1427925 |
End bp | 1430534 |
Gene Length | 2610 bp |
Protein Length | 869 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640469121 |
Product | hypothetical protein |
Protein accession | YP_001153786 |
Protein GI | 145591784 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000000620673 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGGTTGT TGCTCGCCGG CTTCGCCGTC ACGGTAGTGG TATACCTGGT GCTAGCATTA AACGCCCCTA AGGTGTTCGA AGTACTTGTA TATGACGAGT CTAAACTAAT GCCTCCAGAC ATAGAGCCTG AGGTAGTGAA CAGGATTGTG GCAAACACCA ACAAAGGCGA GGCGACGGTG CCTGTAGTAA TATTTGGCCC TCGCGTTGAA GAAAAGGCGA GGGAGCTATC TCGCTTGTAT CCAAACGCCA CTACGCCCTG GACAGTGCTA GACAAGGCGT TGGAAGCATA TTACAAAAAG GTAGGCGAAG TGGTAGACAA CGCCACGGCG AGGTTCAGGG AGGCGGCGCT TGAAATAGCG CGCTATACTA ATTCCACATG CGGCGACTTG GAGAGGCTTG CCGACGCCTA TAGAAAGGCG AGGGAGGAGG CGAGGCGTCT GCTTTTGGCT ACCTACGGCG TGGCGGCGTA TGGAAAGGCT GTAGACAACT CCACTGCGCA GTTTCTCCAG CTATACGAGG AGTATGTCAA GACATACGAT GTGGACACAG CGGTGAGGCT CGCGGCTGAC AAGGCGTATG GCAACGTGTC CAACCTGCTG GCTAACGTGA CTTGGAAGAA CTGGGCAACT GATAATGCGG TAGACAGTGT AGCTGTCGCG ATACTATCGG CCAGGCTTAA CAAGACGCTG ATAGACACGG CGCGTGCGGT GAGCGTGTTA GGTGCAAGGC AGTATTTGTA TGTGGCGTTG CTAAACAGCA CGCCTGCCAT GCTCCGCCCC TATCTACCCC TCTTGGTGTG TGGTGGGGAT GTAGAAAAAG CGGTGGGCAC ATTCCGCAAC GCGTTGTGGA GAAACATCAC TTCGCACAAC CCCCCACCTA CTCTTTACTC TTTGCCTCTA GTCTCAGAGC TGGTCTACCG GGATCAATAT GCAATTGCGG TGGTGAAAAC AAGCGGAGGC CAGCCCACAA TACCGGCGGA CATCGGCGTG CCGGTCTCGA CCTCTCAGCT TCTCAAAAGC TTCCAAGAAG TTGTGACCCA AGATGTTTCG CTGATTGATA AAACCACTGC GGCCGCGCTG TTTTTAGTTC TCTTGACTGT GCTGGGCACC ATACTGACCC CCCTCATAAT TGTGTCAACA GTGGGATTAA CATACCTCGC GTTGCTCGGC TTTATTTTTC ACATACGCGA CGTCCAGCCC ATATACTACC TCACGGTGTA CATGGCCGCG CCGGTGGTCT TCGCAATAGG GGTGGACTAT ATGCTACTAA TGTCGGGGAG ATACGCCGAG GAGCGCGCAC ACGGCAACGA CAAACTTTCC GCAGTCTCCA CTGTTAGGAA GTACGCCAAC AGGGCCATAG CCGCAAGCGC AGCTGTGGTG GCAACTTCGC TGGGCTCCTT TGCTGTGTCT CCACTTCCCT TTATGCAGTC TATTGGGATA GGCTATCTCA TAACAACGGC ATTTGTCGTG GTATCGGTGT TTGTAATTTT CCCATCGCTC CTCCTCATAT TAGGCGACAA GATCTTCTGG CCTAGGAAGA CTATAAGCGC CCACGAGGGG AGATCTAAAT TTCTGGAAGT TGCGGTGTCA ACAGCGCTTA GGAGACCTCT CCTCACTACC GTAGTTGCAG TGCTTATTAC CCTCCTCTCT TTTATGTTCC TCATCACCAC GCTTAAGGTC ACCACCAATC CCGTAGTGGC GATGCCCGAG ACGCCGTATA AGAAAGCCCT TGAAGTAGCG ACTACTTACT TCTCCAACAT AACCGCTTTA TCCACGACTT ACATAGCCAT GCGCAAGCCC CCGCCCGGCG GCCTTTTAGC AGAGATAGAA AAACTCCCTC ACTTTGTCAA CTACACTGTA GACCAGCGGG GGGAGTGGTA CGTCGTTTCT ATAAAGCTCT CCGTGGAAGA CACCTCCGAC GAGCTCCTCT ACATCTACCA CAGGCTTGAC GAACTGCGCC GCGTCTACGG CCCATTTTTG ATAGGAGGTG CGGCTTCCTG GAAAAATGTA ATATTCAGCG AGATATATGT ACGATTTTGG AATCTTCAAG TCTACATAAT AATAGCGTTG GCGTTCCTAA TACTCTCATT CCTCCTGAGG AGCTTCCTCA TCCCCGCCCG CCTACTGGCA ACTGTGTTGA TGTCAATATC CTGGTCGCTG GCCCTCGAGG TTCTGATATT TCAGGAGGCT ATGGGAAAGC CCACATATTG GCTTGTGCCT GTGATACTGT TCGCGTTTCT CATCGCCATA GGTACTGATT ACGATATCTT CATCGTCGCG AGAATTCGGG AAGAGCTAGA ACGCGGCCTA GGAGAGAGAG ACGCGATAAA GAGGGCGATT GTGGCCACTG GGCCTATAGT GACCGGGGCC GCCATGATCC TCGCGGCTGC TTTTAGCACT CTCCTCCTCT CTCAGACGCT CGTGCTGAGG CAGGTGGGAT TCACAATTGC ATTAGCGGCG CTGATAGACG CCTTTATTAT CAGGCCGCTG GTGGTGCCTG CAATGATGGT ATTGGCTGGC CGATATAACT GGCTGTGGAT TGGAGGATAC AGCATTAACA CTTATAAACA GAGCGCAGAT CCAGTAGGTG CGGATAGTCG ACGTTCTTAA
|
Protein sequence | MRLLLAGFAV TVVVYLVLAL NAPKVFEVLV YDESKLMPPD IEPEVVNRIV ANTNKGEATV PVVIFGPRVE EKARELSRLY PNATTPWTVL DKALEAYYKK VGEVVDNATA RFREAALEIA RYTNSTCGDL ERLADAYRKA REEARRLLLA TYGVAAYGKA VDNSTAQFLQ LYEEYVKTYD VDTAVRLAAD KAYGNVSNLL ANVTWKNWAT DNAVDSVAVA ILSARLNKTL IDTARAVSVL GARQYLYVAL LNSTPAMLRP YLPLLVCGGD VEKAVGTFRN ALWRNITSHN PPPTLYSLPL VSELVYRDQY AIAVVKTSGG QPTIPADIGV PVSTSQLLKS FQEVVTQDVS LIDKTTAAAL FLVLLTVLGT ILTPLIIVST VGLTYLALLG FIFHIRDVQP IYYLTVYMAA PVVFAIGVDY MLLMSGRYAE ERAHGNDKLS AVSTVRKYAN RAIAASAAVV ATSLGSFAVS PLPFMQSIGI GYLITTAFVV VSVFVIFPSL LLILGDKIFW PRKTISAHEG RSKFLEVAVS TALRRPLLTT VVAVLITLLS FMFLITTLKV TTNPVVAMPE TPYKKALEVA TTYFSNITAL STTYIAMRKP PPGGLLAEIE KLPHFVNYTV DQRGEWYVVS IKLSVEDTSD ELLYIYHRLD ELRRVYGPFL IGGAASWKNV IFSEIYVRFW NLQVYIIIAL AFLILSFLLR SFLIPARLLA TVLMSISWSL ALEVLIFQEA MGKPTYWLVP VILFAFLIAI GTDYDIFIVA RIREELERGL GERDAIKRAI VATGPIVTGA AMILAAAFST LLLSQTLVLR QVGFTIALAA LIDAFIIRPL VVPAMMVLAG RYNWLWIGGY SINTYKQSAD PVGADSRRS
|
| |