Gene Pars_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2369 
Symbol 
ID5056245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2117561 
End bp2118835 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content50% 
IMG OID640469920 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001154564 
Protein GI145592562 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACG ACGTCGTAGT AGTCGGAGCT GGCCCGGCGG GCCTCGCGGC GGCGTACAAG 
CTTGCCTCTG CTGGTTTTAA AGTACTTGTC CTGGAAAGAG GTAGAGAGCC AGGCGCTAAG
GAGTTGTACG GCGGACGTAT TTATGCTTAC TGGCTTGATA GATTTCTCCC TGAATTTCGT
AAAGATGCTC CGGTCGATAG GTGGGTTAGG AAAGAAAGAG TGACTTTGCT GACAGAGAAC
AAGGCTCTGA CTGTGGAGTC GGCGGTATTA GAGAAGGAGA GGTCTAGCTT CGTGGTGCCG
TTGGTTTCTT TTGTTTCGTG GATGGCAAAG CTTGCACAAA ACGCAGGTGC GAAGATAGTG
ACTGAGATCA CCGTTGATGC GTTGGTGAGA GATGAAAAAG GCAGATTTGT GGGCATCCAG
TCTGGTTCTG ACATGGTGCA AGCCGACTTT ATAATAGACG CAGAGGGAGT TAACCGCTTG
CTTCTAGAAA GGGCTGGTAT TGTGAAAAAA CTAGAGCCTC ACTACGTCGC CGTGGGAGTT
AAGGAGGTGT TGAAATTTGA AAACAAGAAG GTGCTGGAAG AGAGGCTCGG CCTAGACGAA
GACGAGGGGC TTGCGTGGGC TATTGCCGGC TATCCCACAG AATATCTGCC GGGCGGCGGC
TTCATATATA CGTACAAGGA CTCTCTCGCA CTTGGAGTTG TTGTTTATTT GAAGAACTGG
GAGAAGTTGA AGACTCCGGT ATACGATCTC GTGGAAAAAC TCCGCCTACA CCCCTACATA
GCGTCTCTCG TCAAGGGGGC TACATTACAA GAGTACGGGG GGCACATGAC ACCTGTGGCG
GGCATCAACA TGGCGCCGCC GAGGTTTTAC TATGATGGCC TACTGATAGC AGGAGACGCC
GCAGGCTTCC TCCTCCATAC AGGTGTCCTT ATAAGAGGTG TCGACTTTGC CATAGCTTCG
GGAGTATTGG CCGCGGAGGC TATAAAAGAG ACAAATAGCC CCTCTGCCGA GGATCTCTCT
GTATACGAGA AAAAGCTTAG AACAAGCTTT ATACTGCCTC AGCTTGAAAA GTTTAGAAGC
GCCGACAAGC TACTGGGCGA CGAGGCTCTC TTTAAGGACC TGGCTGTATT TTCCACGGAG
GCGGCGTATA GGTACTTCAA CATTGATGAC AAGCACAGAA CGCTACTAGA GGCGGTACGC
GAGGCGTCGA AGAAGACCGG AATAAGTACA CTAAAGATAA TGATAAATAT GCTAAGAGCG
GTGAGGAGTC TATGA
 
Protein sequence
MKYDVVVVGA GPAGLAAAYK LASAGFKVLV LERGREPGAK ELYGGRIYAY WLDRFLPEFR 
KDAPVDRWVR KERVTLLTEN KALTVESAVL EKERSSFVVP LVSFVSWMAK LAQNAGAKIV
TEITVDALVR DEKGRFVGIQ SGSDMVQADF IIDAEGVNRL LLERAGIVKK LEPHYVAVGV
KEVLKFENKK VLEERLGLDE DEGLAWAIAG YPTEYLPGGG FIYTYKDSLA LGVVVYLKNW
EKLKTPVYDL VEKLRLHPYI ASLVKGATLQ EYGGHMTPVA GINMAPPRFY YDGLLIAGDA
AGFLLHTGVL IRGVDFAIAS GVLAAEAIKE TNSPSAEDLS VYEKKLRTSF ILPQLEKFRS
ADKLLGDEAL FKDLAVFSTE AAYRYFNIDD KHRTLLEAVR EASKKTGIST LKIMINMLRA
VRSL