Gene Pars_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1401 
Symbol 
ID5056367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1263622 
End bp1265145 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content61% 
IMG OID640468944 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001153613 
Protein GI145591611 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.324731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.9044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTAA TAACATCACT TGAGATGTAC GTCGCTGATA GAAACGCCGA GTGGCTCGGC 
GTGCCCCGGC TGGTTCTCAT GGAAAACGCG GGGGCCGCTG TGGCGCGTAA TATTTTGAAG
AAGTATCCCC ACGCTTCTAG GGTGTTGGCT ATATGCGGGA CGGGAGATAA CGGGGGGGAC
GGCTACGTGG CTGTGAGGCA CCTCCACGCC GCTGGGAAGG AGGTGCGGGT GATCGCGCTG
GGCGAGCCAA GGGAGGAGCT AGCGGCGAGG AACTACCATG CTGTTAGGAG GCTGTGGGGG
GTCGAGGTTG CTGTGGTTCA GTCCCCTCTT GAGCTCTTGG CGTTGCAAGA CTGGCTTATG
TGGGCAGATG TTATAATAGA CGCGGTCCTA GGCACGGGGA TTAGGGGCGC ATTGAGGGAG
CCGCACGCAA CGGCGATTGA GCTCATGAAC ATCGCCCCGG CGCCTAAGGT GGCGGTGGAT
ATCCCAAGCG GCTTAGACCC CGACACGGGC GAGGTGAGAG ACAAGGCAGT GAAGGCGGCT
CTCACCGTGA CTTTCCACAA GGCGAAGAAG GGACTCCTCG CCCCCAGCGC GGCGCGGTAC
GTGGGGGAGC TGGTGGTGGA GCCGATTGGC ATTCCGCCTG AGGCAGAGGT CATAGTCGGC
CCCGGCGACT TTGCCTACCT GAACTTCTCC CGGAGAGCCG ACTCGAAAAA GGGCGACCAC
GGTCGGGTTC TAGTGGTGGG AGGCTCCTTG GAGTACTCCG GCGCTCCGGT ATTTGTGGCT
AAAGCCGCCT TGAGGGCTGG GGTGGATCTC GCAGTGATCG CCGCGCCGGA GCCGGCGGCT
TATGCGGCAA AGGCCATGGG CCCCGACGTG ATAGCAGTGC CCCTAGAAGG CCCCCGGCTA
TCGCTGAGAC ACGTTGAAAA GATCGCCTCT TTGGCGGAGA GATTTGACGT AGTGGCTATT
GGCCCCGGCC TCGGCACAGA GGGGGAGACC CCAGACGCCG TTAGGGAAAT CTTCAAGAGG
CTCGCCGGCA GAAAACCGCT GGTGGTAGAC GCAGACGCCT TGAAGGCGCT AAGGGGCGAA
AAGGCGGCGG GGGTTACTAT CTATACGCCC CACGCCGGGG AGTTCAAGGC GCTTACGGGA
ATTGAGCCGC CTGAAGCCCT TAGGGAGAGG GCAGAGGTCG TGAAGCAACA AGCCGCATCA
ATAGGCGCAG TCATTTTGCT AAAGGGCAGA TACGACGTCA TATCCGACGG GGTCAAGGTG
AAGATAAACG CCACAGGCAC CCCCGCCATG ACTGTCGGCG GCACCGGCGA CGTGTTGACG
GGGCTAGTCG CGGCGTTTTT GACAAAAACT ACAAACCCTC TAGAGGCGGC AGCGGTGGCC
GCCTTCGTCA ATGGGCTGGC AGGAGAGGAG GCGGCCGCCC AGTTATGCTT CCACATCACC
GCAAGCGACC TCCTGGACAA GATACCGGGC GTAATTAGGA AATTTGCAAG AGAGGAGGTC
ACCCACGCCT CCTCGAGAGC TTAA
 
Protein sequence
MDVITSLEMY VADRNAEWLG VPRLVLMENA GAAVARNILK KYPHASRVLA ICGTGDNGGD 
GYVAVRHLHA AGKEVRVIAL GEPREELAAR NYHAVRRLWG VEVAVVQSPL ELLALQDWLM
WADVIIDAVL GTGIRGALRE PHATAIELMN IAPAPKVAVD IPSGLDPDTG EVRDKAVKAA
LTVTFHKAKK GLLAPSAARY VGELVVEPIG IPPEAEVIVG PGDFAYLNFS RRADSKKGDH
GRVLVVGGSL EYSGAPVFVA KAALRAGVDL AVIAAPEPAA YAAKAMGPDV IAVPLEGPRL
SLRHVEKIAS LAERFDVVAI GPGLGTEGET PDAVREIFKR LAGRKPLVVD ADALKALRGE
KAAGVTIYTP HAGEFKALTG IEPPEALRER AEVVKQQAAS IGAVILLKGR YDVISDGVKV
KINATGTPAM TVGGTGDVLT GLVAAFLTKT TNPLEAAAVA AFVNGLAGEE AAAQLCFHIT
ASDLLDKIPG VIRKFAREEV THASSRA