Gene Pars_0409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0409 
Symbol 
ID5054842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp356595 
End bp357992 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content61% 
IMG OID640467974 
Productphytoene dehydrogenase-related protein 
Protein accessionYP_001152661 
Protein GI145590659 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCTA TCGTGATAGG GGGCGGCTTC GGCGGGCTGG CCTCTGCGGC TTTACTCGCA 
AAACGGGGCT ACGAGGTTGT TTTGGTAGAG AAGAACTGCA GACTTGGCGG CAGATCTGTT
CTTTACGACA TCCGTGGCCA CAGGGTCGAG ATAGGTCCGA CGTGGTACCT AATGGACGAC
GTCATAGACA AGGTGTTGGG CGAGATCGGC GGTAGGACGT ACGAGGTGGC TGAGCTTAAC
CCCAGTATGA TGTTCGTGGA CAGGCGCCAC GGCAAGATCG AGGTAGGGAG AGATCTGCCG
CAACGCCTTG AAGAGCTGGA GCAGGGGGCG GGGGATCGTT CTGTTGAGCT CATGCGCGAA
GCTGGCAGGC TGTATCAAGT TGTCGTGGAG CACATGTTGC TGAGAAAGTA CGAGACGTGG
CTCGACATGC TGTCGGCCGC AAAGGCCGGG GCAGGTTTCG CCAAGTACCT CGTGACCAGC
TTCGGAAGCC TTGTGGAGAG GAGGTTTAAA TCGCCTTTAA TACAGCGCCT TTTGGAGTAT
GATATCATGT TTCTAGGGAG CCCCCCGCGC GAGTTGCCCG CGCTCTACGG CCTACTTCTT
AACTACTCCG TCTTCGTGAG AGGTGTTAAG GCGCCTAAGG GCGGCTTCGC CGCTGTTATC
CGTAATTTAA TAGAAGCCGG GGCCCGACTG GGGGTAGACT TCAGGACATG CACAGCGGCG
AGGAGGATTT TGGTGGAGGG CGGCAAAGTG AGGGGCGTGG AGACAGCCAG CGGGGTTTTG
GAGTCGGACG TGGTGGTGAT CAACGCCGAC TACAAGCGAG GCGAGGAGCT TCTGGAGCCC
CGGTACAGAT CTTACGGCGA GGCTTACTGG GGCCGGGTCA AGATGGCGCC GTCTGCGTAC
ATGGCGTTGC TGAGCGGGGA TCGGTGGGAG GGGCCGCCCC ACTTGATATA CATCTCTGAG
TGGGAGCGAC ACCTATCGGC CCTTACCGGC GGCGGGGATA TGCCTCAGCT CCCCTCTTTC
TACCTCCACG TGCCCAGCGT AGTGGAGCCC GACTGGGCCC CACCCGGAAG GTCGAGCATG
TTTATCCTAG TGCCTTCGCC GCCTGGAGTA GACTATTGGC CAAGGGGGCT AGCCGAGAAG
CTAGCGGCGG AGGCCACCGG CGGCTCGGCC GAGACGCTGG CGGAGTTTCC CAGCCGCTTC
TTCTGCGACT ACTACGGCGC CTACCAGTGC ACGGCGCTTG GCCCCAGGCA CACGCTACGC
CAGACCGCCC TGGGCAGGCC TTTAATGAGA GGCCGAATGG TACGTGGGCT GTACTTCGTG
GGGCAGTACA CCCATTCGGG CATCGGCGTG CCATCGGTGC TGGCCTCGGC GTACATCTTG
GCTCGGTACT ATGTCTAG
 
Protein sequence
MRAIVIGGGF GGLASAALLA KRGYEVVLVE KNCRLGGRSV LYDIRGHRVE IGPTWYLMDD 
VIDKVLGEIG GRTYEVAELN PSMMFVDRRH GKIEVGRDLP QRLEELEQGA GDRSVELMRE
AGRLYQVVVE HMLLRKYETW LDMLSAAKAG AGFAKYLVTS FGSLVERRFK SPLIQRLLEY
DIMFLGSPPR ELPALYGLLL NYSVFVRGVK APKGGFAAVI RNLIEAGARL GVDFRTCTAA
RRILVEGGKV RGVETASGVL ESDVVVINAD YKRGEELLEP RYRSYGEAYW GRVKMAPSAY
MALLSGDRWE GPPHLIYISE WERHLSALTG GGDMPQLPSF YLHVPSVVEP DWAPPGRSSM
FILVPSPPGV DYWPRGLAEK LAAEATGGSA ETLAEFPSRF FCDYYGAYQC TALGPRHTLR
QTALGRPLMR GRMVRGLYFV GQYTHSGIGV PSVLASAYIL ARYYV