Gene Pisl_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1972 
Symbol 
ID4617903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1788513 
End bp1789631 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content61% 
IMG OID639785063 
Productmajor facilitator transporter 
Protein accessionYP_931462 
Protein GI119873455 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.106436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.000000141049 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTTCCC AATGGAAGTT GATAACTATA ACAGGCATAG GCTGGCTCTT CGACGCCATG 
GACGTCCTCC TCTTGTCGTA CATACTAGTG GCGTCGGCGG CTGAGCTAGG GATGGGGGTC
TGGGAGAGAT CCGTGGTGGT ACTAGCGAAC AACCTCGGCA TGTTAATCGG CGCAACTGCC
TTTGGGAGAC TGTCAGACAG GCTTGGCAGA AGGGCCGTCT TCACAGCGAC GCTCCTCCTC
TACAGCCTTG CCACGGCTGC CACAGCCTTT GTAAAAACCG GATGGGAGCT TGCAGCTGTG
CGTCTTATCG CCGGGCTGGG CCTAGGCGGC GAACTCCCCG TCGTCGCCTC CTACGTATCT
GAGCTCTCGC CGCCAGACAG GAGGGGGAGA AACGTAGTTA TTCTAGAGAG CTTCTGGTCT
CTCGGCGCGT TGGCGGCAGC CGCCGTGGCG TACTTCCTCT TCCCCCGCCT CGGCTGGAGA
ACCGCCCTGC TCCTCCTCGG CCTCACCGCG TTATACGCCG CGGTGATAAG GGCAACGCTC
CCCGAGCACA AACCCGCGGC CAAGGGGGCT GTCTCTATTG AGACGAGACG GCTCTACCCA
GTGTGGTACA TATGGCTAGT GCTGGCGTTT GGCTACTACG GCGTCTTCCT CTGGCTACCC
ACCATCCTCG TCAGAGAGAG GGGACTAGCC GAGGTGCAGA CCTACCAGTT CATGTTAATT
ACGACAATTG CTCAGATCCC CGGTTACTTC ACCGCCGCTT ACCTCGTGGA AAAAATCGGG
AGGAGACCCA CCGCAGCGAT CTTCTTCCTC GGCTCCGCCG CATCGGCGGC CGCCCTCATA
TACAGCGTTA GCTTGCCCCA GCTTTACATC TCTGCCATCG CGCTGAACTT CTTCAACCTA
GGCGCCTGGG GCGTGGTATA CGCCTACACG CCCGAGCTTT TCCCAGAACA CGTCAGAGGT
TTTGCCACTG GGACCGCCGG CTCTGCCGCA CGGGTGGGGA TGATCCTCGG CCCCTGGCTC
TACCCGGCGG CCGGTCTCTA CGCCCTAGTG GCAGTGCCTC TCCTCTGGCT CACCGTCCCC
GCCGCCGTAT ATACCCTGCC GGAGACCAAG AGACGCTAG
 
Protein sequence
MISQWKLITI TGIGWLFDAM DVLLLSYILV ASAAELGMGV WERSVVVLAN NLGMLIGATA 
FGRLSDRLGR RAVFTATLLL YSLATAATAF VKTGWELAAV RLIAGLGLGG ELPVVASYVS
ELSPPDRRGR NVVILESFWS LGALAAAAVA YFLFPRLGWR TALLLLGLTA LYAAVIRATL
PEHKPAAKGA VSIETRRLYP VWYIWLVLAF GYYGVFLWLP TILVRERGLA EVQTYQFMLI
TTIAQIPGYF TAAYLVEKIG RRPTAAIFFL GSAASAAALI YSVSLPQLYI SAIALNFFNL
GAWGVVYAYT PELFPEHVRG FATGTAGSAA RVGMILGPWL YPAAGLYALV AVPLLWLTVP
AAVYTLPETK RR