Gene Pars_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1175 
Symbol 
ID5055876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1063651 
End bp1065060 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content59% 
IMG OID640468725 
ProductABC transporter related 
Protein accessionYP_001153398 
Protein GI145591396 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.902135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0612895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACAAC AAGCCGAACC CGCGCTTAGG GCTGTCGGTA TTACAAAGTT TTTTCCCGGC 
GTCGTAGCCC TAGACGGGGT AGACCTCGAG GTAAATAAAG GTGAAGTCCA CAGCCTCTTA
GGCGAGAACG GGGCCGGGAA GTCCACCCTC ATCTCAATAC TCTACGGCGT ATATGCCCCC
GACCGCGGCG AGATATATTT AAGGGGGAGG AGGATCACAA TCCACTCCCC CCGGCACGCC
GCCAAGCTGG GTATATCACT GATATCCCAA CACTTCGCCC TGGTGGAGAG CCTCACCGTA
GAGGAGAACC TCAGACTAGC CGGCATAGAC GTGGAAAAAG CCGAAAAAAT CGCTGAACAG
CTCGGCGTAG AGATAGACAT GCGGCGCTAC GTGGCGGAGC TCACCGTGGG CCAGCGGCAG
AGGCTAGAGA TAGTGAAGTC CCTGGCGAGG GAAAGCGATG TCCTCCTCAT GGACGAGCCG
ACGGCTCTTC TCAGCCCCAG GGAGGTGAAG TCGCTTTTGG CAGTGGTGAG GCGGCTTGCG
GAGATGGGAA AGGCTGTGGT ATTCGTCACA CACAAGATCA GGGAGGCGGT GGAGGTGTCC
GACAGAATCA CCGTGCTGAG GCGGGGGAAG AAAATAGCCA CATACGAGAG GCCGTTTGAC
GAAGACGCCC TCCTCGCCGC CATGTTCGAG GCCTCTGTGA AGCGCCGGGT CCACAAAGCG
AGCCGCGCCA CAAGCGAGGT GATATACCGC GCTGAGGGCC TCAAGGGGGG CAGAGTAAGG
GAAGCCACCA TAGAGCTGAG GAGGGGGGAG GTTGTCGCGG TTCTCGGCGT TGCGGGAAAC
GGCCAAGAAG AGCTGATGGA GCTACTAGCC GGCTTCAAAA AGCCCACAGG CGGCAGGATA
TTCCTCGACG GCCAGGAGCT TACGGGGCGG CCCTTTGCCG AGTTTTTAAA AAAAGGCGTC
GCCTACGTCC CCGAGGATAG GTGGCGCGCA TTGGCTAAGG AGCTGTCCGT ACTCGACAAC
TTTAAGATTA GGTGTGTTAA AAACTGCGTG GAGGCGCTCC AAGAGGCCAA GAATGAGCTG
GACATAGACT TCCCGACGCC AAGCGCCAGG GTGGCCGCCC TCTCCGGAGG CAACCAGCAG
AAGGTAATCC TAGCCAGGGA GGTGTGGCTC AGAAAACCCT ACGTCTTGAT AGCGTCTTAC
CCCACCAGAG GCCTTGACGC CGATACTTCG GAGAAGTTCT ACGCCCTGGT GAGAAGCGCT
GTGACCGGCG GCGCCGTGGT CAGCCTAGAA GACGTGGAGG AGGCCGTGGA GAAGGCTGAC
AGGATATACG TCATGTCCAG AGGCCGCATC GTTGCGTCCT TCGAGCCCCC CTTCGACATC
GGCGAAATAG CCGAGGCGAT GACACAATGA
 
Protein sequence
MPQQAEPALR AVGITKFFPG VVALDGVDLE VNKGEVHSLL GENGAGKSTL ISILYGVYAP 
DRGEIYLRGR RITIHSPRHA AKLGISLISQ HFALVESLTV EENLRLAGID VEKAEKIAEQ
LGVEIDMRRY VAELTVGQRQ RLEIVKSLAR ESDVLLMDEP TALLSPREVK SLLAVVRRLA
EMGKAVVFVT HKIREAVEVS DRITVLRRGK KIATYERPFD EDALLAAMFE ASVKRRVHKA
SRATSEVIYR AEGLKGGRVR EATIELRRGE VVAVLGVAGN GQEELMELLA GFKKPTGGRI
FLDGQELTGR PFAEFLKKGV AYVPEDRWRA LAKELSVLDN FKIRCVKNCV EALQEAKNEL
DIDFPTPSAR VAALSGGNQQ KVILAREVWL RKPYVLIASY PTRGLDADTS EKFYALVRSA
VTGGAVVSLE DVEEAVEKAD RIYVMSRGRI VASFEPPFDI GEIAEAMTQ