Gene Pars_1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1766 
Symbol 
ID5055351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1586746 
End bp1587840 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content58% 
IMG OID640469311 
Productmajor facilitator transporter 
Protein accessionYP_001153969 
Protein GI145591967 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000930556 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATATAA GGCTTATAAT AATGTTGGGG CTCGTCTCAC TTTTTGCCGA TTGGTTATAC 
GAAAGCATGC GCGCCGTGGC TCCGCAATAT CTATACATGC TGGGCGCAAC AGCGGTGTTT
GTGGGCTTCG TTTTCGGGCT AGGCGACGCT TTGGGCTACG CCGCGCGTGT AGTGACGGGT
CCTCTGGCCG ACAGGAGAGG CGGCTACTGG CTGGAGACTT TTCTCGGCTA TGGCCTACAG
ATAGCCGCCG TCGGCGGCTT GATATTCGCA AAGGATCTAT GGCAAGCCGC CGGGCTGATT
TTCTTGGAGA GGTTTGCCAA AGCTTTGAGG ACACCCGCGC GTGATGTGCT CATATCGGCC
GCGGGAGGCG GCAAGGCGAA GGGCAGGGCC TTCGGCATCC ACGCGGCTCT GGATCAGATA
GGGGCTATTA TCGGCGCCGC TATGGCTACG GCGATGTTGT ATATGTACTA CACGCCAAGG
GACGTCTTTG CAACGGCTTT GCTTCCCGGT GCCGTCGCAC TGGCTCTACT CTACGCGGCG
TATAGGCTAA GCGGTGTGAG GCCGTCCGGC AGAGGCCGTG TCGGCGGGGG ATGGAGGGCG
GCTACGGCCT TTGCGGCTAC GCAGTTTTTC CTCGGCCTCT CCCTAACACA CATCTCGCTG
TTTCAGTACA GGCTAGCCGA GGTTCCTTGG CTCGCCTCGT TGCTGTTCCT AATAGCTATG
ATCGCCGAGG TGCCTGCCTC ATTGCTGTTG GGTTTTCTCC ACGACAAATC GTCTAAGGCG
CTTCTCATAG GGCCCGTATT CACCGTGTTG CTCGCGCTGT CGTTCATGGC GGGTGGGCAT
TACTTGTTCT TGGGCGCAGC GCTGTACGCA GTAGCTACTT CCTATGCCGA TGTGGTGGCG
AAGGCCTACG CGGCGAAGCT AGGCGCTGCT GCCTCGTTAG GTCTCGTCAA CGCGATGTGG
GGACTAGGGC TGTTAGCTGG CGGGGTAGTC TACGGCTTTT TAACAGACAT GGGGATTTAC
TGGGCAATCG GGGCACTAGC CTCCGCCGCC TCGTTGGCCT CTTTCTACAT GCTATGGAGA
TTGACCACGT ACTAG
 
Protein sequence
MNIRLIIMLG LVSLFADWLY ESMRAVAPQY LYMLGATAVF VGFVFGLGDA LGYAARVVTG 
PLADRRGGYW LETFLGYGLQ IAAVGGLIFA KDLWQAAGLI FLERFAKALR TPARDVLISA
AGGGKAKGRA FGIHAALDQI GAIIGAAMAT AMLYMYYTPR DVFATALLPG AVALALLYAA
YRLSGVRPSG RGRVGGGWRA ATAFAATQFF LGLSLTHISL FQYRLAEVPW LASLLFLIAM
IAEVPASLLL GFLHDKSSKA LLIGPVFTVL LALSFMAGGH YLFLGAALYA VATSYADVVA
KAYAAKLGAA ASLGLVNAMW GLGLLAGGVV YGFLTDMGIY WAIGALASAA SLASFYMLWR
LTTY