Gene Pars_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1069 
Symbol 
ID5055376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp952458 
End bp953558 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content62% 
IMG OID640468625 
Producthypothetical protein 
Protein accessionYP_001153299 
Protein GI145591297 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTGCG GTCTGCCGAG GTGCCCCATC GAGGAGAGGA TTAGGGCCGT GAAGTCCTCC 
CTCCTCAAGA TCCGGGGCCG GGAGGTCTTC GGCGCCACGC CCCCCAGCGC AGTTGTCGGC
GAGGCTGGGT GGCCGCGGGT GAGAGTCTAC ATCGGTGAGC CCCCCGAGGT GACAGGCGAG
GAGGCCAGAG CTTACGACGA CCCGCGGCTG CTCTGGGGGA GGGAGCTGGA GGAGATTCTA
AGGCTCAGGA GCTACATGGT ATTCGGCTAC GCGGCCCAGA CAAGCCCGCG GAAGCTCGGC
GAGTTGCCCC TCCTCGCCGT CTCCGAGAGG CCGGTGGACG TGGAGATGCG CCTAGCCAAA
ACCCCTGTGG AGAGCCTCAA GTTCGACCTA AGGGAAAAGC CCATGGGCCC CAGGGCGCCT
CTCGAGGCCT TGAGGATAGA CGGCAACCCG GCGGTGCCGC GGGCGTTGGA CAAGCTGATG
TCCGACGATC TGGGCGCCGG GGCTGCCGCC GTTGAGCTGT ACAGAAGGGG TGTCGACCTC
TACACTATCC AGAGGGCCTT CGCCCTAGGC CTACTCGGGG CGAGACACAG GCGGAGGCTC
GTCCCGACGC GGTGGAGCAT AACCGCAGTC GACGTGGCTA TCGGCGATGC CCTGGCGCAA
CAGGTTAGGC ATATGCCGGA GGTTTCACAA CCCCTATACG GATACGCCGA GTATCTAGAC
AACCGCTACC TGGTCGCCGT GGTCCCCGGC CCGCTTAGGT TCTACTACCT GGAGAGGTGG
ACATATGCAG GAAGAGTCGC CGAGATAGAG GTGGCGGAGG ACCCCCGGGG AGTGCGAAGC
ACCATGGACG GCGGCTACGA AGCCGCCAGG CTGGCGATAC TGGAGAAGCT GGCCTCAATG
GGCAGACGAG GCACTGTGTC AATAGTGAGG TGGATAGGCG AGAGGTACTA CGTCTCGGTG
GGCAACTGGC AGATAAGAGA AACCCTACGC AGACTCCAGC TGAAGCCGCT AGACGAAAAC
TACAAGACAT ACACCGCGCT GGTAGGGAAA GACCCGATCT CACTCATAAA AAATACTAAG
AGACTAGACG AGTTTCTCTA A
 
Protein sequence
MLCGLPRCPI EERIRAVKSS LLKIRGREVF GATPPSAVVG EAGWPRVRVY IGEPPEVTGE 
EARAYDDPRL LWGRELEEIL RLRSYMVFGY AAQTSPRKLG ELPLLAVSER PVDVEMRLAK
TPVESLKFDL REKPMGPRAP LEALRIDGNP AVPRALDKLM SDDLGAGAAA VELYRRGVDL
YTIQRAFALG LLGARHRRRL VPTRWSITAV DVAIGDALAQ QVRHMPEVSQ PLYGYAEYLD
NRYLVAVVPG PLRFYYLERW TYAGRVAEIE VAEDPRGVRS TMDGGYEAAR LAILEKLASM
GRRGTVSIVR WIGERYYVSV GNWQIRETLR RLQLKPLDEN YKTYTALVGK DPISLIKNTK
RLDEFL