Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1972 |
Symbol | |
ID | 4617903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 1788513 |
End bp | 1789631 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639785063 |
Product | major facilitator transporter |
Protein accession | YP_931462 |
Protein GI | 119873455 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.106436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.000000141049 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTTCCC AATGGAAGTT GATAACTATA ACAGGCATAG GCTGGCTCTT CGACGCCATG GACGTCCTCC TCTTGTCGTA CATACTAGTG GCGTCGGCGG CTGAGCTAGG GATGGGGGTC TGGGAGAGAT CCGTGGTGGT ACTAGCGAAC AACCTCGGCA TGTTAATCGG CGCAACTGCC TTTGGGAGAC TGTCAGACAG GCTTGGCAGA AGGGCCGTCT TCACAGCGAC GCTCCTCCTC TACAGCCTTG CCACGGCTGC CACAGCCTTT GTAAAAACCG GATGGGAGCT TGCAGCTGTG CGTCTTATCG CCGGGCTGGG CCTAGGCGGC GAACTCCCCG TCGTCGCCTC CTACGTATCT GAGCTCTCGC CGCCAGACAG GAGGGGGAGA AACGTAGTTA TTCTAGAGAG CTTCTGGTCT CTCGGCGCGT TGGCGGCAGC CGCCGTGGCG TACTTCCTCT TCCCCCGCCT CGGCTGGAGA ACCGCCCTGC TCCTCCTCGG CCTCACCGCG TTATACGCCG CGGTGATAAG GGCAACGCTC CCCGAGCACA AACCCGCGGC CAAGGGGGCT GTCTCTATTG AGACGAGACG GCTCTACCCA GTGTGGTACA TATGGCTAGT GCTGGCGTTT GGCTACTACG GCGTCTTCCT CTGGCTACCC ACCATCCTCG TCAGAGAGAG GGGACTAGCC GAGGTGCAGA CCTACCAGTT CATGTTAATT ACGACAATTG CTCAGATCCC CGGTTACTTC ACCGCCGCTT ACCTCGTGGA AAAAATCGGG AGGAGACCCA CCGCAGCGAT CTTCTTCCTC GGCTCCGCCG CATCGGCGGC CGCCCTCATA TACAGCGTTA GCTTGCCCCA GCTTTACATC TCTGCCATCG CGCTGAACTT CTTCAACCTA GGCGCCTGGG GCGTGGTATA CGCCTACACG CCCGAGCTTT TCCCAGAACA CGTCAGAGGT TTTGCCACTG GGACCGCCGG CTCTGCCGCA CGGGTGGGGA TGATCCTCGG CCCCTGGCTC TACCCGGCGG CCGGTCTCTA CGCCCTAGTG GCAGTGCCTC TCCTCTGGCT CACCGTCCCC GCCGCCGTAT ATACCCTGCC GGAGACCAAG AGACGCTAG
|
Protein sequence | MISQWKLITI TGIGWLFDAM DVLLLSYILV ASAAELGMGV WERSVVVLAN NLGMLIGATA FGRLSDRLGR RAVFTATLLL YSLATAATAF VKTGWELAAV RLIAGLGLGG ELPVVASYVS ELSPPDRRGR NVVILESFWS LGALAAAAVA YFLFPRLGWR TALLLLGLTA LYAAVIRATL PEHKPAAKGA VSIETRRLYP VWYIWLVLAF GYYGVFLWLP TILVRERGLA EVQTYQFMLI TTIAQIPGYF TAAYLVEKIG RRPTAAIFFL GSAASAAALI YSVSLPQLYI SAIALNFFNL GAWGVVYAYT PELFPEHVRG FATGTAGSAA RVGMILGPWL YPAAGLYALV AVPLLWLTVP AAVYTLPETK RR
|
| |