Gene Pars_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1218 
Symbol 
ID5055212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1102319 
End bp1103458 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content57% 
IMG OID640468765 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001153438 
Protein GI145591436 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.155112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAA AAGTGGTTAT AGTAGGCGGC GGCGTGGGAG GATCTTTTGT GGCAAACAAG 
CTCGCCTACA AGCTCAGCTC AGAGGTGAGG AAAGGCGAGG TGGAGATAAC TGTCATAGAG
CCGGCCCAGG TATTACACTA TCAGCCAGGG TATCTCTACG TCCCGTTTAT GGAGCTTCCG
TCTGACGTAA TGTTCAGAAG CCCCAAGAAG GTGCTGAGCC CGTTGGTTAA GCTCGTGGAG
AAGCCCGCCG CGAAGATTGA CCTCAAGGCC AAGAAGGTGC AGACGGCAGA CGGCGCCGAG
TATCCATACG ACTACTTAGT AGTGGCCTCG GGGGCCGTGG CGAGGACCGA CGCGATTCCC
GGCTTCAACA AGACGTGGTA CACCCTGTGG ACTTACGAAG GAGCGAAGGC GCTTAGGGAG
AGGCTGAGGT CGTTTACTAG GGGCACTTTG GTCTTCAGTG TGACGTCCAC GCCGTATAAG
TGCCCCGTGG CGCCGTACGA GTTCCTGTTC ATGTTTGACG ACTACCTCAT GTCCACTGGG
CTTAAGAAGG ATGTGAAGCT GATCTTCACA ACCGTCGCGC CGCACCTCCA CGCACAGCCC
AACGTGAACA AGTTCTTGGA GGAGCAGATG AAGATGAGGG GTATTGAGTA CAGGACGAAG
TTCGAGGTGA AGGAGATCAA GGAGGGCGAG GTGGTTGGCC CAGAGACCAT AAAGGCCGAC
CTAGTGGTCG CGGTCTCCAA GCACACGCCC GCCGACGTGG TGGTTAATTC GGGGCTAGTG
GATCAAAGCG GCTGGCTTCC CGTGGATAAG AGCACATTGC AGATACAAGG CGGCTCAGGT
GTGGAATATG CCATTGGCGA TACCACAAAC CTCGCTGTGC CGAAGGCCGG TTCCGTGGCC
CACTTCCAGT CGGAGGTCGT GGCGTCGCGG ATACACGAGG AGATTACCTT GGGCCACGCC
GACACGGTGT ACAGAGGTCG GGTGATCTGT TTTATAATGA CCGGTTTTGA GGAGGCTACC
CAGGTGTCGT GGAATTACGA AAACCCGGCG CTGTACCCGC CGCCCAGTAG CAAGTTCTTC
GCGAGGCTTA AGGACCTCAC CAACTACTCG ATATGGGGAG TCATGAGGTG CGGCCTATGA
 
Protein sequence
MPKKVVIVGG GVGGSFVANK LAYKLSSEVR KGEVEITVIE PAQVLHYQPG YLYVPFMELP 
SDVMFRSPKK VLSPLVKLVE KPAAKIDLKA KKVQTADGAE YPYDYLVVAS GAVARTDAIP
GFNKTWYTLW TYEGAKALRE RLRSFTRGTL VFSVTSTPYK CPVAPYEFLF MFDDYLMSTG
LKKDVKLIFT TVAPHLHAQP NVNKFLEEQM KMRGIEYRTK FEVKEIKEGE VVGPETIKAD
LVVAVSKHTP ADVVVNSGLV DQSGWLPVDK STLQIQGGSG VEYAIGDTTN LAVPKAGSVA
HFQSEVVASR IHEEITLGHA DTVYRGRVIC FIMTGFEEAT QVSWNYENPA LYPPPSSKFF
ARLKDLTNYS IWGVMRCGL