Gene Pars_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1709 
Symbol 
ID5054565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1543075 
End bp1544424 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content56% 
IMG OID640469252 
ProductUDP-glucose/GDP-mannose dehydrogenase 
Protein accessionYP_001153912 
Protein GI145591910 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.676622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0907516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGTCG AGCTTCTCAA GCGTGGTGAG CTCACAGTCG CGGTCTACGG CCTCGGCTAT 
GTCGGGATGG CCCTATCCGC CGCCTGGACG CTGGCTGGGG CTAGGGTCAT AGGCGTCGAC
GTAGATGCGG TAAAAGTAGA GAAGCTGAAC AACGGTGTGG TGGAGTACCC AGAGAGAGAT
GTCGTGGAGG TACTTCTACC AGCAGTGAAA AACGGGAGGT TTACTGCAAC TACTGACGGC
GTCGTGGCGT CAATAAGAAG CCAAGCGAAG ATCGTGGCAG TTCCTGTCTT CCTCAAGAAG
TCGGCTACCT CAGTGGAGGT GGACTTCTCT GCCCTCATCT CCGCCTCAAA GGCCATAGGG
GCTGGCCTTA AAAAAGGCGA CTTAGTGATA ATAGAATCCA GCGTGCCGCC CGGCACCACA
GAGGAGGTCG TTAAGCCTGT GCTAGAAAAC ACCTCCGGCC TTGAGGCGGA GGAGGACTTC
TTCCTCGCCT ACAGCCCCGA ACGCATAATG GTAGGCCACG CCCTCAAGGA CATCGTGGAG
AACTACCCCA AGGTAGTTGC CGGCGTCGGA CCGAAGAGCA CAGAAGAAGC CGCCGGGCTT
TATAGACTAG TGTCCAAAAA AGGCGTAGTG GTGCTGAACA GCGCCAAGGA GGCTGAATTC
GAAAAACTAC TAGAAGGCGT ATACAGAGAC GTCAACATAG CCCTAGCCAA CGAGATGGCG
AAGCTGGCAA ACGCCCTAGG CATATCCTTC AGAAAGGCTA GGGAGGCCGC CAACAGCCAG
CCCTACAGCC ACGTACACAA ACCAGGTTCA GGAGTCGGAG GCAACTGCAT CCCCGTATAC
CCCTACTTCC TCATGTGGGT AGCGGCTAAA TACGGCGTAG ATCTCCCCCT TACGCGCGCA
GCTAGGGCGA TAAACGAGAG GCAACCGTCA GAAGTGGCCT TCGCCGCGGT TAGGGCAATG
CTCAAAAATA GAGTAAACCC AGCAACTGCC AAGATTGCGA TTCTAGGGCT GGCTTTTAGA
GGCGACGTAG ACGACCCCCG CGAAAGCCCC ACATACGGCA TAATCTCCAC TCTACTAAAC
ATCGGAATAA GGCCAGAGCA GATTGTGGTA CACGACCCCT ATATCAAGCA GGATCCCCAG
CTGGCAAAGT GGGGCATCGC CATCTTCCAA GACCTAGAGG CGGCGGTGAA GGGGGCAGAC
GCCGTCGTGG TGTCAACAGA CCACACAGTC TACAGGATAG AGGCAAGTAG AATAGCCAAG
CTCATGAGAA CGCCTCTAAT TGTGGACGCC CGCGGGGTAC TTGTCCCAGA CGTCGAGATA
TACTCAATCG ACGGAGGGCG CTGGCCTTAA
 
Protein sequence
MLVELLKRGE LTVAVYGLGY VGMALSAAWT LAGARVIGVD VDAVKVEKLN NGVVEYPERD 
VVEVLLPAVK NGRFTATTDG VVASIRSQAK IVAVPVFLKK SATSVEVDFS ALISASKAIG
AGLKKGDLVI IESSVPPGTT EEVVKPVLEN TSGLEAEEDF FLAYSPERIM VGHALKDIVE
NYPKVVAGVG PKSTEEAAGL YRLVSKKGVV VLNSAKEAEF EKLLEGVYRD VNIALANEMA
KLANALGISF RKAREAANSQ PYSHVHKPGS GVGGNCIPVY PYFLMWVAAK YGVDLPLTRA
ARAINERQPS EVAFAAVRAM LKNRVNPATA KIAILGLAFR GDVDDPRESP TYGIISTLLN
IGIRPEQIVV HDPYIKQDPQ LAKWGIAIFQ DLEAAVKGAD AVVVSTDHTV YRIEASRIAK
LMRTPLIVDA RGVLVPDVEI YSIDGGRWP