Gene Pars_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0485 
Symbol 
ID5055606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp430226 
End bp431374 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content57% 
IMG OID640468049 
Producthypothetical protein 
Protein accessionYP_001152734 
Protein GI145590732 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.964373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGGG AAGAAGCAGT AGAGAGAGCG AGGTTGCATA TTATACCAAG GACGTACGAC 
GTTCTTTCGC GCTTTGGCTA TATCACGAGT TTGGCTAAAG AGGTGAGGAA AGTAAAAGAA
GAGGTTATTA GAAACCTTGA TAATTACATA GAAGAGACCA GAAAGGCTGT AGAGAGGATA
GGGGGCAGGT TTATCTTGGC GTCAACCGCC ACAGAGGCAG TAGAGTCGGC GGTTAAAATT
GTGGGGCAAG GCAAGGTGGT TGTCATGAGT AAAAACAACG TGGCGACAGA GACAGGTCTG
CGCCAAGGTC TAGAAAGGGC TGGAAACGAA GTGTGGGAGA CGGATTTGGG GGAGTTCCTG
GTACAATTAG CTAATGACGA GCCTAGCCAC ATATTAGCAC CGGCTGTCCA CATGACTAAG
GAGAGGGCTG CAGAGGTCTT GGCAAAGAGG CTAGGAATGG CGGTGCCGCC TGAGCCCGAG
GCCATTGCGC AGAGGGCGAG GGAGTTTCTC CGCAACAAGT TCATAAAGGC AGACGTCGGG
ATCACGGGGG CCAACGCCAT CGCCGCAGAT ACGGGGGCTG TGGTGCTGGT GGAGAACGAG
GGCAACATAA GGCTGACGTC TGGCCTCCCC CCAGTGCACA TAGTCTACGA CGGCGTGGAG
AAAATCGTGC CGACACTGGT AGACGCCATG GCCGCGGCGG CGGTGCAGTC CGCCTACGCC
GGCCTCTACC CCCCCACCTA TATAAACATC TCCGCCGGCC CCAGCTCCAC GGCAGACGTG
GAGATGCACA GAGTTTCACC CGCCCAAGGG CCAAAGGAGT TTTACATGAT CTTGGTAGAC
AACGGCCGCA GAGCCGTTGC GAGGGATCCG GTGTTGTGGG AGGCACTCCT CTGCATACGG
TGCGGCCGTT GCCACCTCCA CTGCCCAGTC TACCGCGCCT TGGGGAGGGA GTTCGGCGTG
CCGCCCTACA CCGGCCCCAT GGGCGTGATG TGGACCGCCG TGACGAGAGG CATAGAGGAG
GCCGGCCCCC ATGCGCTCAA GTGCGTCCAC GCGGGCAACT GCAAAGAGGT ATGCCCAATG
GGCATAGACA TCCCCGGGGT GATACACGAG GTGAAGAAAA GGTACCTATC TCCAACTGGG
TCCAAGTAA
 
Protein sequence
MSWEEAVERA RLHIIPRTYD VLSRFGYITS LAKEVRKVKE EVIRNLDNYI EETRKAVERI 
GGRFILASTA TEAVESAVKI VGQGKVVVMS KNNVATETGL RQGLERAGNE VWETDLGEFL
VQLANDEPSH ILAPAVHMTK ERAAEVLAKR LGMAVPPEPE AIAQRAREFL RNKFIKADVG
ITGANAIAAD TGAVVLVENE GNIRLTSGLP PVHIVYDGVE KIVPTLVDAM AAAAVQSAYA
GLYPPTYINI SAGPSSTADV EMHRVSPAQG PKEFYMILVD NGRRAVARDP VLWEALLCIR
CGRCHLHCPV YRALGREFGV PPYTGPMGVM WTAVTRGIEE AGPHALKCVH AGNCKEVCPM
GIDIPGVIHE VKKRYLSPTG SK