Gene Pars_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1524 
Symbol 
ID5054083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1383374 
End bp1384438 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content60% 
IMG OID640469065 
Productmajor facilitator transporter 
Protein accessionYP_001153730 
Protein GI145591728 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.38221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATAAACC TAGCAACTCT TCTTTTTTTC ACAGCCAACG GAATCGCCGT AGTGGCAATA 
CCGCCGTATT TAAGAGACCT CGGCGTGAGG AGCGAGTCGG TAATAGGCGC CATTGTGTCA
ACGGCGTTTT TCGTGTCAAT AATAATGCGG CCCGTCAGCG GGGTGCTAGG CGACAGGATA
GGCTACATAA CCCTCATGAG GGCAGGAGTG GCCTCGGCGG TGGCCGCCCA GGCCATGTAC
CTAGTGGGCG ACCCGTTTTG GGTACAAGTG GGGAGGCTAT TCCACGGCCT CGCAATAGCC
ACCTTCCTCC CAATGTCAGT AGCCGCCTCA GTTGCCGAGG GCCCCAAGGC GATGGCCGCC
CGGTCTCTGG CAGTGGGCGT GGGCAACGTC CTTGGCCCCC TCCTCGGTAG CGCTTTATAC
GACATAGGAG GCGCGCGCCT CTCCTTCATC ACAGCCCTCG GGCTCCACGC CTCCAACTTT
GCCCTGGTAA GAGGCGGCGA CAAGACGCGT AGCCCCGGAG AGCCGGGCAC GGGCATAGAG
AGGCGGGTAT TCCTATTCAT GGCACTACTA TCGCTCTACG GCGCCGCTTA TATGGGCATC
TCCACCTTCA TCCCAGTAAA ACTCAGAGAC AACAACCTCC CCATAGCCTA CTGGGGCCTC
TTCTCATCCT CCGCCGCCTT GGTGAGCCTC TTGCCTAGGG CTTTCCTATT GAAGAAAGGC
CTCGTGACGC CAACAACCGC CGGAGCCGCC ACGGCGGTTG CGGCCCTGGG GATGGCCGCA
GCGACCTTTG CAGATGGGCC ACTCCTCTTC GTAGCCGCCG GGGCCATATA CGGCCTGGGA
CAAGGCGCCG TGGTTGTCAC ATACCAGATA CTGGCACTAG CCGGGAGCAA GAGGGCGGGG
GTAAGCAGCT CTGTGTACAC AATGGGCTGG GACGTCGGAT CCATAATAGG CCCCGTCCTC
GGCGGCTGGC TCGTGGAGAA CTTCGGCCTA GCCGCGTTGC ACTACACCCC CCTCCTCCTG
GCGGCGAACG TCGCAGTGCT GTTTTTATAC GCAAGACGTA AGTAA
 
Protein sequence
MINLATLLFF TANGIAVVAI PPYLRDLGVR SESVIGAIVS TAFFVSIIMR PVSGVLGDRI 
GYITLMRAGV ASAVAAQAMY LVGDPFWVQV GRLFHGLAIA TFLPMSVAAS VAEGPKAMAA
RSLAVGVGNV LGPLLGSALY DIGGARLSFI TALGLHASNF ALVRGGDKTR SPGEPGTGIE
RRVFLFMALL SLYGAAYMGI STFIPVKLRD NNLPIAYWGL FSSSAALVSL LPRAFLLKKG
LVTPTTAGAA TAVAALGMAA ATFADGPLLF VAAGAIYGLG QGAVVVTYQI LALAGSKRAG
VSSSVYTMGW DVGSIIGPVL GGWLVENFGL AALHYTPLLL AANVAVLFLY ARRK