Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1543 |
Symbol | |
ID | 5054172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1398677 |
End bp | 1399825 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469084 |
Product | major facilitator transporter |
Protein accession | YP_001153749 |
Protein GI | 145591747 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0259552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGGG GGGAGTTTAA AATTGTAGTG CTGAGTGGCC TCGGCTGGAT GTTCGACGCC ATGGACGTCT TGATACTCTC ATACTTACTC GTGGCGATGA GAGAAGAGCT GGCGCTGGAT CGCGCGGCAT CGACGTGGAT CGTTCTGGCA AATAACTTAG GCATGTTCCT CGGCGCCTTC CTCTTCGGGA AGCTCGCCGA CGTCGTGGGG AGGAAAAAGG TGTTTATGGC CACTATGTTG CTTTACAGCA TTGCCACCGC GGCGTCCGCC GCCGCTAGGA CGTGGCAGGA GTTCGCCGCA ATTAGGTTCT TCGTGGGAGT TGGGCTAGGC GGCGAGTTGC CCGTGGTGGC CACGTACGTC TCGGAGAACT CCCCACCTGA GAGGAGGGGG AGAAATGTGG TTCTCCTAGA GAGCTTCTGG TCGATAGGCG CTCTCCTCGC CGCCGCCGTG TCGCTCTTTA TCTTCACCAC ATTAGGGTGG AGGACGGCGC TTGTGTTGAT GGGGGCCACA GCCTTCTACG TCTTCGTAAT ACGCTCCGCC CTCCCGGAGT CGCAGAGGTG GCTGGAGAGG ATCAAAGAGG GAGCCTCGGC GGAGCTTAAG CCTTACGCCG CGAGACTCGC CATAGCTTCA GCCATTTGGT TCCTCCTAGC CTTTGGCTAC TACGGCGCGT TTATCTGGTT GCCCACAATG CTCAGGACAG AGAGAGGCTT CACACAGGTG GCCACCTACG AGTTCATGTT TTTGACAACC ATCGCCCAGC TCCCGGGCTA CTTCTCAGCG GCATACCTCG TGGAGAGAGT GGGCAGGAGG CCAATAGCGG CGGCGTACTT CGTAGCCTCG GCTCTATCTG CGGTTTTGCT GATATACAGC ACGTCGTACG CCCAGCTCTT CTACGCGGCC CTCGCACTCA ACTTCTTCAA CCTCGGGGTC TGGGGTGTCG TGTACGCATA CACCCCCGAG CTTTTCCCCA CTTCTATAAG GGGCCTTGCG ACAGGTCTAG CGGGCTCAGC CGCAAGGATC GGAATGATTA TTGGACCTAC GCTGTATCCG CTTTGGGCCT CCGTAGCATT CATAGGCGTC GCAGTTGCGT GGCTAATAGC GTCAGCCCTA GTAGCGCTTT TGCCCGAGAC AAAAGGCCGT GAGGTGTAG
|
Protein sequence | MTRGEFKIVV LSGLGWMFDA MDVLILSYLL VAMREELALD RAASTWIVLA NNLGMFLGAF LFGKLADVVG RKKVFMATML LYSIATAASA AARTWQEFAA IRFFVGVGLG GELPVVATYV SENSPPERRG RNVVLLESFW SIGALLAAAV SLFIFTTLGW RTALVLMGAT AFYVFVIRSA LPESQRWLER IKEGASAELK PYAARLAIAS AIWFLLAFGY YGAFIWLPTM LRTERGFTQV ATYEFMFLTT IAQLPGYFSA AYLVERVGRR PIAAAYFVAS ALSAVLLIYS TSYAQLFYAA LALNFFNLGV WGVVYAYTPE LFPTSIRGLA TGLAGSAARI GMIIGPTLYP LWASVAFIGV AVAWLIASAL VALLPETKGR EV
|
| |