Gene Pars_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0213 
Symbol 
ID5056368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp192405 
End bp193544 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content59% 
IMG OID640467792 
Productputative ABC-2 type transport system permease protein 
Protein accessionYP_001152480 
Protein GI145590478 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1668] ABC-type Na+ efflux pump, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.331802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.747646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGCAT TCACGGCACT GGTGGTGAAG GACCTCCAGG AGATTTTGAC ATCTCGCTAC 
TTCTTGGCTT CTCTTCTAGG CGGTTTTGTA GTCTTAATAG CGCTGGGCGC CGTGATGGGG
GCGTCGATAG AAACTGCCCA GAAGGCTACT CAGAGTTTTG CAGTGGTGGT GGGGAACACC
ACAGAGCTGG GCCAGAGGTA CGTGGAGCTG TTAAGGGAGC TCGGCGGTGT TTTGTACGAG
AAGTTCTCCC CCGACTTGCT TGACAGGTAC AGCTACGCCG TCGTGGTGCC GCCTAATTTC
ACACTGCCGG CAAAGGTGGA GGTGTATGCA AAATACAGGG GGTTGTTGTC CACGGCGACG
CCTCTCTTCG TGGAGCTGGC CGCGCAGAGG CTTGCCGAAG AGGTGGGCGT GCCGCCGCAG
CCTATCAACA CGGAGCTCTA CGTATACCTC GGCGATCGCG TGTTAAAGGC TGGGGAGGTG
GCGATGCTGG CAAATCTCTT CCTCATATCT TGGATGTTTA TGTTCCTGGT GCCTCTGCTA
GTCGCCTCCA CGGCGGCGGT TGCGGTGGGG CTTGAGAAGG AGAAGAGGAC GTTTGAGCTT
ATCCTCTCGA CGCCGGCTAC TGCGAGGACG CTCGTCGCGG CGAAGCTCAC AAGCGCCGTG
GCGCTGGCGT TTATACAATT CGCCGTGATG GCCGTGGCGT TTATCTTCTA CTTCTACAAC
CTTTCCAGAG CCGCACCCCC CGTGCTTTCC GGCGAAGTTG CGGGGGAGGC AGTAGCTCCG
TCGCCCGCGT TGTTTGTCCC GGTGGCTTTG TCCACGCTGG CCTTGTCCTT GGCGCTACTG
GGACTCGCGT TTATAGCCGC GACCAGAGTC GAGGATATAA AGACGGCGCA GAGCGTCGTA
CCCATGGTTG TGTTCCCCCT CCTCGTGCCG TCCTTCGCCG CGATCTTCGG CACAGTGGAA
GGCCTTGAGG CCTACCCCTT CGTCCACCCA CTGGCTGTGG CATATTCGGC GCTGGTTGGG
CAGTGGGATA AGGCCTACGC CTTTCTCGCA ACTGATTGGG CCTTGGCCAT CGCCGTCGTC
GCATCTATAC TCAAATTCGT CACCACGGAC TACCTCATAA CTGGGAGGTG GAGGCGATGA
 
Protein sequence
MGAFTALVVK DLQEILTSRY FLASLLGGFV VLIALGAVMG ASIETAQKAT QSFAVVVGNT 
TELGQRYVEL LRELGGVLYE KFSPDLLDRY SYAVVVPPNF TLPAKVEVYA KYRGLLSTAT
PLFVELAAQR LAEEVGVPPQ PINTELYVYL GDRVLKAGEV AMLANLFLIS WMFMFLVPLL
VASTAAVAVG LEKEKRTFEL ILSTPATART LVAAKLTSAV ALAFIQFAVM AVAFIFYFYN
LSRAAPPVLS GEVAGEAVAP SPALFVPVAL STLALSLALL GLAFIAATRV EDIKTAQSVV
PMVVFPLLVP SFAAIFGTVE GLEAYPFVHP LAVAYSALVG QWDKAYAFLA TDWALAIAVV
ASILKFVTTD YLITGRWRR