Gene Pars_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0172 
Symbol 
ID5054330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp156272 
End bp157768 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content58% 
IMG OID640467751 
Producthypothetical protein 
Protein accessionYP_001152439 
Protein GI145590437 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.329648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCCTGG TTAAAGTTTT AAATACAAAA ATGTGGGCAC ATATGCGTAA CATTCCTATC 
AACAGGGTAT CTGACTACAT CTGGGAAATC CCAGCCGGAG TAAAGCCTTG TCAGAAGGTG
CCCGTGAGGA TCTACGCAGA CTCCGTACTG CTGGAGAAAA TGAAAACTGA CATGACCCTA
GAGCAGGGCA TCAACGTCGG GTGTCTTCCC GGCATTTATA AGTGGTCAAT TGTGCTCCCC
GACGCGCACC AGGGCTACGG CTTTCCCATT GGAGGTGTCG CCGCCATTGA CGCCGAGGAG
GGCGTAATCA GCCCTGGGGG CATCGGCTAC GACATCAACT GCGGTGTGAG GGTGCTTAGG
ACAAACCTCA CGGAGCAGGA GGTTAGGCCA AAGCTTAAGG AGCTGGTGGA CACGATCTTC
CGCCTCGTGC CGCCCGGGGT TGGAGGAACC GGCCATCTGA GGCTATCGCC TGGCGAGTTC
GAACGTGTTT TGGCGGAGGG GGTGGAGTGG GCGGTGCAGA AGGGCTACGG CTGGGCTGAG
GACATGGAGT ACATAGAGGA GAGGGGGTCT TGGAAGCTGG CCGACCCGTC TAAGGTCTCG
GAGAAGGCCA AGGCGAGGGG GAGGGACCAG CTGGGCACCC TGGGGTCTGG CAATCACTTC
TTGGAGATAC AGGTGGTGGA CAAGATATAC GACGAGAAGG TGGCCAAGCT CTTCGGCATA
GAGAGGGAAG GCCAGGTAGT GGTAATGATC CACACGGGGA GCAGAGGCTT CGGCCACCAG
GTGGCGACGG ACTACCTCTT GATCATGGAG AGGAAAATGA GGCAGTGGGG GCTAAACCTG
CCAGATAGGG AGCTGGCTGC AGCGCCGCTT AAGGACAAGG TGGCGGAGGA CTACATCAAG
GCCATGGCAT CTGCGGCTAA CTTCGCCTGG ACGAACCGCC ACATCATCAT GCACTGGGTG
AGGGAGGCGT TTAAGAAGGT GTTCGGCTCT ATTGAGAAAG TTGGTCTGGA GATAGTATAC
GACGTGGCTC ACAACATCGC CAAGCTGGAG GAGCACGTCG TGGACGAGAA GGGCACTGTT
AAGAAGGTGT GGGTCCACCG CAAGGGCGCC ACTAGGGCCT TCCCGCCCGG CAGGCCGGAG
ATCCCGGCGA AGTACAGAGA GGTAGGCCAG CCGGTGCTGA TCCCCGGCTC TATGGGCACA
GCCTCGTGGA TCCTCGTCGG CACTCACGAT TCCATGAGGC TGACCTTCGG CACTGCGCCC
CACGGCGCTG GGAGGGTGCT CAGCCGCGAA GCCGCCATTA GGATGTACCC GCCGCACAAG
GTGCAGGAGG AGATGTCCAA GAGGGGGATA ATAGTCAGGT CTGCTGAGAC CGAGGTGATA
AGCGAAGAGG CGCCTTGGGC CTACAAGGAC GTGGACCGCG TAGTCGAAGC CGCCCACCAA
GTAGGATTTG CTAGGAAAGT GGTCAGGCAG AGGCCTATAG GAGTAGTAAA GGGCTAA
 
Protein sequence
MCLVKVLNTK MWAHMRNIPI NRVSDYIWEI PAGVKPCQKV PVRIYADSVL LEKMKTDMTL 
EQGINVGCLP GIYKWSIVLP DAHQGYGFPI GGVAAIDAEE GVISPGGIGY DINCGVRVLR
TNLTEQEVRP KLKELVDTIF RLVPPGVGGT GHLRLSPGEF ERVLAEGVEW AVQKGYGWAE
DMEYIEERGS WKLADPSKVS EKAKARGRDQ LGTLGSGNHF LEIQVVDKIY DEKVAKLFGI
EREGQVVVMI HTGSRGFGHQ VATDYLLIME RKMRQWGLNL PDRELAAAPL KDKVAEDYIK
AMASAANFAW TNRHIIMHWV REAFKKVFGS IEKVGLEIVY DVAHNIAKLE EHVVDEKGTV
KKVWVHRKGA TRAFPPGRPE IPAKYREVGQ PVLIPGSMGT ASWILVGTHD SMRLTFGTAP
HGAGRVLSRE AAIRMYPPHK VQEEMSKRGI IVRSAETEVI SEEAPWAYKD VDRVVEAAHQ
VGFARKVVRQ RPIGVVKG