Gene Pars_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0036 
Symbol 
ID5054361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp29639 
End bp30784 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content51% 
IMG OID640467616 
Productsmall GTP-binding protein 
Protein accessionYP_001152305 
Protein GI145590303 
COG category[R] General function prediction only 
COG ID[COG2262] GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03156] GTP-binding protein HflX 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAATA AGGCGCTCCT AGCCTATTCG GGCCCAAAGA CCCCCAATCT TGTTTACAAA 
CTGGAGGAAT TCGCTTCGCT GGTTGAGGTT GCTGGGTTCG AAGTTTCGGA GCTGGTGACC
CAGTACGGGA GGGCGGACAC GCGGTTCTAT CTAGGAGCTG GCAAGGCGAG GGAGGTCGCC
TCTAAGGATT TCGACGTATT CATAGCGTAC CACAGCTTAA CACCCCTTCA GGTTTTCAAT
TTGGAACAGC TCTTCAAGAG GAGGGTGGTG GACAGGGTTT TTGTAATTCT GGTAATTTTT
GAGAGGAGGG CGGGTAGTAT AGAGTCTAAG TTGCAGATCG AGCTCGCTAG GCTTAGGTAC
GAACTGCCTA AGGTGAAGGA GTATTTGAGG AGGGCCAAGA TGGGGGAGCA ATTAGGCTTT
CTGGGCGCCG GCGAGTATGT AATAGACTCG TACTATCGCC ACATGGTGAG GCGCATTTCG
TCAATTAAAA AGAGGCTCGA GGAGGCGAAG AAGGGGAGGG TTATGCATAT AAAAAAGAGG
AAAGAGGCCG GAGTCCCCGA GGTTGTGTTA ACCGGCTATA CAAGCGCCGG CAAAACTACG
CTGTTTAACA GACTTGTGAG CGAGAATAAG ATCGTCGATG GGAGGCCCTT CGCGACGTTG
GAGACCTACA GCAGGGCGCT GGATATATGG GGTAAGAGAG TTGTGTTGAC AGATACGATA
GGCTTTATCG ATGATTTGCC TCCAGTCCTT ATAGAATCGT TCCACTCCAC GCTACAAGAG
ATCATAGATG CCGATAGGAT CTTGTTGGTA ATAGACGGCT CAGAGCCTTA CGAGGAGGTG
GCGCGGAAAA TCAGCACCTC GGTGAGAACA TTGGGAGAAG TAGGCGTAGA TCGCAGTAAA
ATTATCCCAA TTGTTAACAA GGTGGATAAA ATAAGGCTAG AGGAGCTGAG GAACCTGAGG
AAGGTGTTGG AAAAGTATTT CACGTGGTTT GTCCCGGTGT CCGCTCTCAC AGGCTTCGGC
ATAGAGGCGC TTAAGGCCGT CTTGTTTTTC CAAGTGCCTG GCTACACAAT TGTTAGGGCA
AGCGGCGATG GAAATCCGGT GGGGCTCCGT GTGGGCGACG TAGTTTTTGT GCCGGTAAAA
GAGTAA
 
Protein sequence
MRNKALLAYS GPKTPNLVYK LEEFASLVEV AGFEVSELVT QYGRADTRFY LGAGKAREVA 
SKDFDVFIAY HSLTPLQVFN LEQLFKRRVV DRVFVILVIF ERRAGSIESK LQIELARLRY
ELPKVKEYLR RAKMGEQLGF LGAGEYVIDS YYRHMVRRIS SIKKRLEEAK KGRVMHIKKR
KEAGVPEVVL TGYTSAGKTT LFNRLVSENK IVDGRPFATL ETYSRALDIW GKRVVLTDTI
GFIDDLPPVL IESFHSTLQE IIDADRILLV IDGSEPYEEV ARKISTSVRT LGEVGVDRSK
IIPIVNKVDK IRLEELRNLR KVLEKYFTWF VPVSALTGFG IEALKAVLFF QVPGYTIVRA
SGDGNPVGLR VGDVVFVPVK E