Gene Pars_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1032 
Symbol 
ID5056147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp919146 
End bp920285 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID640468588 
Producthypothetical protein 
Protein accessionYP_001153262 
Protein GI145591260 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2407] L-fucose isomerase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.775746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATTG AGGTGGCCGC CGCACCCAAC GTAGACGCCG AGACGCGGGA GGAGTACAGA 
TCCCTCTACC GAAAAACCCT AGGGGGCCTT GGGGAGGGGG CGAATTTCAT AGTGGTCTTG
ACGGGGGGCT CAGAGCCCGA GATCCTAGCC GCCGCCGGGG ACTACAACAT CATCCTCGCG
TGGCCCCACT ACAACTCCCT CCCAGCCGCG CTTGAGGCCG CCGCGGCGCT TAAGGAGGCG
GGGAGGTTTG CCCACGTAGT CCAGCTGGAG GGGCCAGGTG CGGAGCCGCC TAGAGACAAG
CTGGAGAGGC TCCTCCGGGT GGTGGACCTC TTGAGGAGAC CGCCCAGGCT CGGGCTTGTG
GGGTCGCCTA ATAGGTGGCT TGTGGCCTCA TGGCTGAGGG GCAAGCCAGA CGTAGTTATC
GACGAGGGGG AGGTCTACGC CCGCAGCGTG GAGAGGGACG GGGCGGACGT CGCCGAGAGG
CTTGTGAAAG GCGCAGAGCG TAGCGACTTC TCGGCCCGGG ATCTAGCTCC CATAGCCGCG
TATGCCAAGA CCCTGGCAGA GCGCGCCTCG GGGCTTGACG GAATCACCCT GGGCTGCTGG
TGCTTCGACT TCGAAGAGGT TAGGAAGAGG GGGTGGACGC CTTGCATTTC CCTCGCCCTC
CTCAACGACT GGGGGGTAAT GGCCACGTGC GAGGGGGATG TGAGGGCTCT CTACTCCGCG
GTGGTGTTAA GGCGCCTCTC CGGGAGGCCG AGTTGGATTA GCAACGTGAA CAAGATATAC
GACTGGGGCC TCTTGTTGAC GCACGACGGG GCGCCGCCAA GCTTTGGCAA ATACGCAGTG
GTCCCCCGCA TGGCTACCAA AGCCGCCGCG GCGCTTAGGG TAACGGTGGA GCCGGGGAGG
CCGGCCACCT TGCTGAGGGT TTCAGGAGAC TTGAAGAGGG CGCTTCTGCT GAAGGGGGTC
ACGGCGGAGG GGGAGAGGGT AGAGGCGTGT AGCACCCAGA TCGCTGTTAG GCTCACCGTG
GGGTCTGGAA GAGACGTCTT GAGGGCTGGG CTCGGCAACC ACTTAGCCTT CGTCCTCGAC
GACGTTTACG AAGAGACTAG GCTCTACTTG GAGCATCTAG GGGCTCAGGT CATCCCCTAG
 
Protein sequence
MPIEVAAAPN VDAETREEYR SLYRKTLGGL GEGANFIVVL TGGSEPEILA AAGDYNIILA 
WPHYNSLPAA LEAAAALKEA GRFAHVVQLE GPGAEPPRDK LERLLRVVDL LRRPPRLGLV
GSPNRWLVAS WLRGKPDVVI DEGEVYARSV ERDGADVAER LVKGAERSDF SARDLAPIAA
YAKTLAERAS GLDGITLGCW CFDFEEVRKR GWTPCISLAL LNDWGVMATC EGDVRALYSA
VVLRRLSGRP SWISNVNKIY DWGLLLTHDG APPSFGKYAV VPRMATKAAA ALRVTVEPGR
PATLLRVSGD LKRALLLKGV TAEGERVEAC STQIAVRLTV GSGRDVLRAG LGNHLAFVLD
DVYEETRLYL EHLGAQVIP