Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1765 |
Symbol | |
ID | 5054401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1585323 |
End bp | 1586738 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640469310 |
Product | major facilitator transporter |
Protein accession | YP_001153968 |
Protein GI | 145591966 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.719018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.000452829 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGATAGCT ACGATATTAG ATACGCCTGG AGAGCAACGC CACTTTTAGG CTCTGTCGCT CTTTTGGTAA TGTACACAGA GGCCATGTTG ATGCCGAGCC TCCCCAAGAT ACAGGCCGAG TTTAACGTCA CTCCCGCAGA CGCCTCTTGG ATTTTGACAA TATACCTAAT TTCTGGCACT ATAAGCGCCG CTATTTTTGG AAACCTCGGC GACATATACG GGAAGAAGAA GGTGCTTTCT ATCGTGATGG CAGCCTATGC GGTAGCGGTG ACCTTTACAG GCTATGCCCC GAATTTCGGA TCTTTGCTAC TCTCGCGTGC CATACAAGGC ATGGGAATGG CGATGTTCCC ACTGGCTTTT TCGCTTATCC GAGAAGAATT CCCGCCACAC ATGGTGCCCA CGGCCCAGGG AGTTGTAAGC GCGATGTTCG GTGTAGGTAT TATAATAGCG TTGCCCGTCG GAGCCTATAT AGCTCAGAAC TACGGGTGGA GAGCCACATA CCACACAGCG ACGCCAATAG CCGTCTTGCT CACCTACTTA ATAGTCACCT ACATAAGAGA GAGTCGGTAC AGAACGCCCA GGAAAATCGA CTTTGTAGGA GTCGCCCTCT TCTCCTCTAT GGCGGCGTCA TTTCTGCTCG CCATATCTAA AGGCCCAGAT TGGGGGTGGT TTTCGCCAAG GATCACCTCG TTGTTTATAC TTTCGGCTGT GTCTGCCGCC GTTTTTGTAA TCCACGAACT GATCACAGAC AGCCCCTTCA TACCGAGAGA TATCTTTAAC AGAAACGTAA TAGCGGCGAC GATCGCAATT CTAATAGTGG CATACGCGTT TCAGATGAAT TCCCAAAATT TGTCGTACCT ATTCCAGATG CCGCCGCCTT ACGGCTATGG GCTAACAATT CTGCAGACCG GTCTCTACAT GTTGCCTCCA GCTATGGTTC AGATAATTGT CGCCCCGCTT TCTGGTAGAT TAATGTGGAG GCTTGGGGCA AAGAGAATTG CTTCACTCGG CGTTGTTTTC GCCGTGGTTG GCTACCAGCT AGCCGCCGCA CACCTCTACA GCGGCGTATG GACGCTAATC TCATACATGA CTCTGGGCTT TGTAGGATTG ACCTTGTTAA ACGTCTCACT TATAAATCTC CTCACGTTCT CTGTGCCTAG AGAGAGACTG GGCGCCGCCA CCGGCCTCAA CACTGTTTTT CGCAATTTTG GCTCAGCTAT CGCTCCCACC GTTGCAGGTA CAGTATTGAC AAACTTTAAT ACCTATATCT ACTACAATAC ACCAGTGGGA TTGGTCTACT TCTCTGTGCC TTCAAAAGAG GCGTATATAA TAAACATCGA CATTGCCACC ATTATGTTTA TCATATCGCT TGTACCAATA TTAATATCAA AAGAAATTCT AAGACTGAAC AAGTAG
|
Protein sequence | MDSYDIRYAW RATPLLGSVA LLVMYTEAML MPSLPKIQAE FNVTPADASW ILTIYLISGT ISAAIFGNLG DIYGKKKVLS IVMAAYAVAV TFTGYAPNFG SLLLSRAIQG MGMAMFPLAF SLIREEFPPH MVPTAQGVVS AMFGVGIIIA LPVGAYIAQN YGWRATYHTA TPIAVLLTYL IVTYIRESRY RTPRKIDFVG VALFSSMAAS FLLAISKGPD WGWFSPRITS LFILSAVSAA VFVIHELITD SPFIPRDIFN RNVIAATIAI LIVAYAFQMN SQNLSYLFQM PPPYGYGLTI LQTGLYMLPP AMVQIIVAPL SGRLMWRLGA KRIASLGVVF AVVGYQLAAA HLYSGVWTLI SYMTLGFVGL TLLNVSLINL LTFSVPRERL GAATGLNTVF RNFGSAIAPT VAGTVLTNFN TYIYYNTPVG LVYFSVPSKE AYIINIDIAT IMFIISLVPI LISKEILRLN K
|
| |