Gene Pars_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0084 
Symbol 
ID5054897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp76955 
End bp78016 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content57% 
IMG OID640467662 
Producthypothetical protein 
Protein accessionYP_001152351 
Protein GI145590349 
COG category[R] General function prediction only 
COG ID[COG1341] Predicted GTPase or GTP-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGAG TCGCGGTCCC CAAGGGCGAC ACTGCCCTTG TAAGAGGGCC GGCCGAGGTT 
CATTGCCTAG ACACGTGCAG AGTCTTCGGA GCGGTGTTCC AGCACTTCGC CGTTCCGCCC
CACAAGCAGT ACCCAGTAGA GGGCCCCGCA GTTTTTGAGC TCGAAGGCGG ATCGCTGATC
TTAGTAAAGG GATCAACGAC GCCGCAGGAC TGGGCGCAAC TGCTTGAAGG AGTTGTGGCA
TTGGTGGGGC CTACGGACTC GGGGAAAAGT AGCCTCACGA CGTATTTGCT AAATCTGCAC
GTCGCCAGAG GCAAAAAAGT CTGTGTGGTA GACGCCGACG TCGGCCAGTC CGACATAGGG
CCTCCAGGAT TCGTCGCGTA TAGTTGCACC TCGGCTCCGG TTCCCCATAT AGCGGAGCTG
GAGCCGTTTG ACGCGTACTA CGTCGGCTCT GTGAATCTCC AGGGAATGGA GGAATTGTTA
ATAGCGGGCG TAGTTCGGTG CCTCAGAAAG GCCATGGCGC AATACCCCCA CCTCGTTATT
ATAAACACGC CGGGATGGAC CACGGGAAGA GGCGTGCAGT TGTTAAGGGC GTTGGCAGAC
GCAGTGGAGC CAGAGGTTAT AAACATAGGG GAGAAGGTGT TGCCAGGCCT TGCGGTGTCG
AAGCCTCCCC ACATCTATCC AAGAGGCCCG CAGGAGAGGA AGGAGCTGAG GAACTACGCG
TTCAAGAGGC ATATCAAACC AGTTGCCAAA GTACAGATAG AGCCTGACAT AGTTGCCAAC
TGCCGGTGGG ACGGCTCACT GAACTGTCCC TGGGGGAGGT ACACACCTGC CGAGGTGAAG
GAGCCGGAGA AGAGGGGTAG GGATTATTTA GTGCCGCCGC ACTACCTGAA ACACCTGCTG
GCGGCGCTCT ACAGAGGCGG AAGACTTGCG GGATACGCAA TAGTGGAGAG GCTGGAGCCT
AAAATAGTCA TGTATTCTAC GACACACGAA TTCGACGAGG TGAGAATCGG CAAGATCAGG
CTAGACCCCC AGACCTTAGA AGAACTTGAG CCGTTGCCCT AG
 
Protein sequence
MFRVAVPKGD TALVRGPAEV HCLDTCRVFG AVFQHFAVPP HKQYPVEGPA VFELEGGSLI 
LVKGSTTPQD WAQLLEGVVA LVGPTDSGKS SLTTYLLNLH VARGKKVCVV DADVGQSDIG
PPGFVAYSCT SAPVPHIAEL EPFDAYYVGS VNLQGMEELL IAGVVRCLRK AMAQYPHLVI
INTPGWTTGR GVQLLRALAD AVEPEVINIG EKVLPGLAVS KPPHIYPRGP QERKELRNYA
FKRHIKPVAK VQIEPDIVAN CRWDGSLNCP WGRYTPAEVK EPEKRGRDYL VPPHYLKHLL
AALYRGGRLA GYAIVERLEP KIVMYSTTHE FDEVRIGKIR LDPQTLEELE PLP