Gene Pars_0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0212 
Symbol 
ID5056412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp191229 
End bp192317 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID640467791 
Producthypothetical protein 
Protein accessionYP_001152479 
Protein GI145590477 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2407] L-fucose isomerase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.931385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGAGG TGGCGCGGCG CCTATTACAG TCTGTGGGGT TTTTCAAAAA CCCAGAGACC 
CCGGCGCCTG AGAAGTTTCC CCTCATCATC CACGCAACTG GCGGCACCAC GGCGGCTGCC
CTTGACCTCG TCAAGAAGAG CGGCGCCCGC GGAGCTGTGC TACTGGGTTT CGGCGAGCAC
AACAGCTTCG CCAGCGTGCT CCACGCCAAG GCCGAGATAG AGGCGCTGGG TCTGCCCGTA
GTAGCCTACC ACTGCCCCTC CTACAATCAG TGCGGAGACG TCGCGGCAAG GGCAAAGAAA
GTCGCCGACA CCGCCTCCTC GCTGGTGGGC GCCAAGGCGG TTCTAATCGG CTCGGAGACA
TACCAAGCAC AAGCCGCCCG GGAGAAGCTT GGCTGGGCTG TTGAGGTTGT GCCCCTGGAG
AAGTTCGAGG AAGCAGTAGA CGCCTCGGAG CCGGACGACG AGCTTCTGAA GCTTTTTGGG
GACGACAGAG TGGCTAAAGT AGCGACGGCC CTAGAGAAGG TCTCGGCAGG TGCGAACCTC
GTCGCAATTC AGTGCTTCCC CTTCCTCATG AAGCGGCGCT ACACCCCCTG CCTGGCCCTT
GCCCTGCTCA ACTCGAGGGG GCGAGTAGTG GCATGCGAAG GCGACTTGGC GGCGGGGCTC
GCCATGCTTA TGTCGAGGGG GCTGACGGGG TACAGCGGCT GGATAGCCAA CGTCGTGTGT
CACGGCGGCG CCGAGGCGGT CTTCGCCCAC TGCACAATAG CGCTTAACAT GGCGAAGAGC
TGGCGGATCA TGCCCCACTT CGAGTCGGGC TACCCCCACG GCCTCGCCGC CGAGCTGAAA
GAGGCGGTCT ACACCGCTGT GTCCATCTCG CCTAGGTTCA ACAAAGCCGC CCTGGGAAGG
GTGGAGGTGG TGAGGAGCGG CAACTTCTTA CAGGAGGCTT GCCGCACCCA GGCCCAGGTG
AGGTTTAGGA GGGCGGTGAA GCTGGAAGAG GAGGCCCCGG CCAACCACCA CGTCTTTACC
CCCGGCGACG TCGTGGACGA GGCCGAGGCC GTGTTGAGGC TGTTGGCGAT CCCCACGTCG
AGATATTGA
 
Protein sequence
MKEVARRLLQ SVGFFKNPET PAPEKFPLII HATGGTTAAA LDLVKKSGAR GAVLLGFGEH 
NSFASVLHAK AEIEALGLPV VAYHCPSYNQ CGDVAARAKK VADTASSLVG AKAVLIGSET
YQAQAAREKL GWAVEVVPLE KFEEAVDASE PDDELLKLFG DDRVAKVATA LEKVSAGANL
VAIQCFPFLM KRRYTPCLAL ALLNSRGRVV ACEGDLAAGL AMLMSRGLTG YSGWIANVVC
HGGAEAVFAH CTIALNMAKS WRIMPHFESG YPHGLAAELK EAVYTAVSIS PRFNKAALGR
VEVVRSGNFL QEACRTQAQV RFRRAVKLEE EAPANHHVFT PGDVVDEAEA VLRLLAIPTS
RY