Gene Pars_1177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1177 
Symbol 
ID5056154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1067114 
End bp1068337 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content62% 
IMG OID640468727 
ProductATPase 
Protein accessionYP_001153400 
Protein GI145591398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0919322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTACTTCT TCGTAAACCC CAAGAAGCCG CCGCAAGCGT TGCTCCACGA GTACGGAGAG 
GCGTTGAGGA AACACGCCGG GCTCCCCCGC TACGTGAAAT TCGACACCTG GGAGGACTTC
TTCGACGCGT TGTTCGCCCT CAAGGGCGGG GTGGTGGCCT TTGACGAGTT CCAGTGGTTC
GCTGAGGTGG CGCCCGAGGT GCCATATATA CTGCAGAAGA TGTGGGACAC GAGGGGGGAG
AAGCCCTCCG TGATAATAAC GGGGTCTGTG GTGGGGATGG TGAAGAGGCT CGTGGCCGAG
TCAGGTGCCC CCTTATTCGG CAGGGCAGAT CTGGTGATAG AGCTGAAAGA GCTGGAGCCG
GGGGCCGTTT TTAAGTGGCT TGAGGACTTG GGGATCCGTG GCGAGGAGGC CTTTAAGCTT
TACCTGCTAT TCGGCGGCGT GCCCTACTAC TACCGCCTAG TGGAGACGTG GGGCGTCAAG
ACCGTGGAGG AGGCCGTGGA GAAGCTGGTG GTCGGCGTGG GCGCCCCCCT TAGGCACGAG
GTGGAGAACG TCCTGGCCGA GTCGCTGGAC AGGGAGTACA GAACCCACTT GGCCATTCTA
GAGGCCGTCG CGGACGGCTC GACTAAGCTC GAACAAATGG CGGCCCGGGC CGGCGTGAAG
GCCACCTCCC TCCCGCCCTA CCTCCACGAC CTTGTGGACA CTCTGGGGGT CCTGGCCAAG
CATAAGGCGA GGCGGACCTA CTACGAGCTC AGGGATAGAT TCTTCGCCTA TTGGCTGAGG
GCGGTGCATA GGCACAGAGA CACCACCCCC GAGGAGAGGC TGGGAGAAAA GGCGCTGGCG
GAGCTCCCCG GGTTTCTCCA ATGGGCCTTC GAGGCGGCTG TTAGGGAGTT AATCCCCAAG
CTGTATCCAG TTCAGAAGGC AGTGAAGGCT GTCTACTACG CCACGCGGGG CGGCGCTAGA
GTGCAACGCG AGGTCGACGC CCTGGCCGTG AACGAAGAGA GGAAATTCGC GGTTGTGGCG
GAGGCCAAGT GGGCGGAGGT CGAGCCAAGG GAGGTACTCC CCCGGCTGGA GGAGGCGGCG
CGCCATCTAA TCCCGCCCGG GTGGGAGGTG AAATACGCCA TATTCGCCAG GCGCATACGA
GGGGAGGCGC CGGCGGATCT ATACGACGTC GAATCGCTGA TCCAAAGGCT CAAAGCCCGC
GGCGTGGAGC ACTCCCCTAT CTGA
 
Protein sequence
MYFFVNPKKP PQALLHEYGE ALRKHAGLPR YVKFDTWEDF FDALFALKGG VVAFDEFQWF 
AEVAPEVPYI LQKMWDTRGE KPSVIITGSV VGMVKRLVAE SGAPLFGRAD LVIELKELEP
GAVFKWLEDL GIRGEEAFKL YLLFGGVPYY YRLVETWGVK TVEEAVEKLV VGVGAPLRHE
VENVLAESLD REYRTHLAIL EAVADGSTKL EQMAARAGVK ATSLPPYLHD LVDTLGVLAK
HKARRTYYEL RDRFFAYWLR AVHRHRDTTP EERLGEKALA ELPGFLQWAF EAAVRELIPK
LYPVQKAVKA VYYATRGGAR VQREVDALAV NEERKFAVVA EAKWAEVEPR EVLPRLEEAA
RHLIPPGWEV KYAIFARRIR GEAPADLYDV ESLIQRLKAR GVEHSPI