Gene Pars_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1914 
Symbol 
ID5055272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1719320 
End bp1720609 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content52% 
IMG OID640469460 
Producthypothetical protein 
Protein accessionYP_001154113 
Protein GI145592111 
COG category[R] General function prediction only 
COG ID[COG1341] Predicted GTPase or GTP-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTAC GCGTCATGAA AGCTGGCGAT ATTTACAGAA TTGAGGGTCC CGCCAAGGTG 
GTCGTTAAAC GTGGCCAAAT ATACGCAACA GGAGTGGTGT ACACCGAGGG GCAGAGCTTC
ACCGTGTTAA GAGCACGGCA ACTTGCTGTG AAAGCTGTGG CTGACTCCGA GGTGGAGCTT
GTTTTGGGCC CGGGGGCTCT TCTGGAAAGA GTGTCGCCTG GCGAGGAGAT TATTGACGAG
TGGGAGAGAA GGATCTCCGG CGTGGATCCC AAGGGGGTTG TGGTTATCGT GGGGATGATG
GATGTGGGGA AATCTACAAT GACTGCGATG CTTGGGAACA AGGCGCTGGC TAGGGGGTAC
AAAGTTGTGA TTATAGACGC CGATGTTGGG CAGAACGATT TAGGTCCCCC TACCACCATA
TCGTTGGCTA GGTTAACGAA GTATGTAACT CATTTAAGAC AACTAGTCGC CGAGAAGAGC
TTATTTCTCC AGTCTACCAG TATGGAGAGG ATATGGCCCA GGGCGGTGGC GCAAATAGCG
AAGGCTGTGG AGTATGCCAA AAAGACGTGG CAACCGGATA CTATCATTGT GAACACCGAC
GGTTGGGTCC TCGACGAGGA GGCAGCAACT TTTAAACGGA GGCTCATAGA GAGGCTAGCC
CCCTCGCTAA TTGTGGCAAT ACAGGTGGAG AACGAGCTGG GGCCTATTCT CAACGGCTAC
AGCAACGTGT TAGTTCTCCC CCCGCCCCCG CACGTGAGGA CGCGGAGCCG TGAGGATAGG
AAAATACACA GAGAAATGGG CTACGGCAGG TACATCTTCC CGCCCGTGGA ACTCGCCGTG
TCTCTTGACA AAATTCCCCT CTGCAACTTG CCCCTCTTCC AAGGAATAGA GATGGGGGAA
GAGCTCAAGA GAATGCTTAC ACGCGCAATA GGCGTCGGTG TGTTGAGAGC CTACCAGGTG
GGGAGCAGAG TCTACGCAGT TGTGGAGGGA GGCGAGTGGG TGGTGAGACG GGTTGGCGGG
TTCCAAGTCG TTGGACTTCC TATAGATTTC GAAAAAGGCC TCTTAGCCGG CCTCGAGGAC
TCAGAAGGTT TTTTGGTAGG ACTCGGCGTA ATAAAGAAGA TTTATTACGA CAGGAAGAGA
GCTATTATCT ATACGTCAAG CGAGGTTGAG AGAAGGATAG GCGAAGTAAA ATGCATAAGG
CTGGGCTTAG TGAGGCTAGA TGACAACTTC AACGAGGTTG AAAAAGCCAC AAACATACTC
AAAGCAGAGG CTGAGCAGTC AACAACGTAG
 
Protein sequence
MSLRVMKAGD IYRIEGPAKV VVKRGQIYAT GVVYTEGQSF TVLRARQLAV KAVADSEVEL 
VLGPGALLER VSPGEEIIDE WERRISGVDP KGVVVIVGMM DVGKSTMTAM LGNKALARGY
KVVIIDADVG QNDLGPPTTI SLARLTKYVT HLRQLVAEKS LFLQSTSMER IWPRAVAQIA
KAVEYAKKTW QPDTIIVNTD GWVLDEEAAT FKRRLIERLA PSLIVAIQVE NELGPILNGY
SNVLVLPPPP HVRTRSREDR KIHREMGYGR YIFPPVELAV SLDKIPLCNL PLFQGIEMGE
ELKRMLTRAI GVGVLRAYQV GSRVYAVVEG GEWVVRRVGG FQVVGLPIDF EKGLLAGLED
SEGFLVGLGV IKKIYYDRKR AIIYTSSEVE RRIGEVKCIR LGLVRLDDNF NEVEKATNIL
KAEAEQSTT