Gene Pars_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2139 
Symbol 
ID5056055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1913983 
End bp1916205 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content57% 
IMG OID640469691 
Productelongation factor EF-2 
Protein accessionYP_001154337 
Protein GI145592335 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00484] translation elongation factor EF-G
[TIGR00490] translation elongation factor aEF-2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTG CAGTCAGGAT TGTGGAGAAG CAACTGGATG AAATTCTGGC CATTGCTAGG 
AACCCGGCGC AGATACGAAA CGCCGGCACT CTGGCGCATG TGGACCACGG CAAGACCACC
ACCACGGACT CGCTCCTCAT GGGCGCGGGG TTGCTCTCGC CGAAGGTGGC AGGGAAGGCG
CTGGCCATGG ACTTCGTCGC TATTGAGCAG CTCCGCCAGA TGACCGTGAA GGCCGCCAAC
ATATCGCTCT ACTTCGAGTA CGGTGGGAAG CCCTACTTGG TGAACTTCGT CGACACGCCG
GGACACGTCG ACTTCACAGG CCACGTCACG CGCTCGCTTA GAGTTATGGA CGGCGGCCTC
GTCGTTGTAG ACAGCGTCGA GGGGGTCATG ACGCAGACGG AGACGGTGGT CAGGCAGGCG
CTTGAGGAGT ACGTCCGCCC AGTCCTATTC ATTAATAAGA TCGATAGGCT GATAAAAGAG
CTGAGGCTAT CGCCGCAGGA GATACAGCAG AGGATCCTCA CAATAGTCAA GGACTTCAAC
GCTCTCATTG ATATGTTCGC CCCGCCGGAG TTTAAGGATA AATGGAAGGT CGATCCCGCC
AAGGGGCAGG TAGCTCTAGG CTCTGCGTTA CATAAGTGGG GCATAACCAT ACCCATGGCC
CAGAAGGCCG GTTTGAAGTT TAGCAACATA GTAGATGCCT ACGAGAAAGG GTATGTGGAC
AAGCTAGGGG AGGAGTTCCC CCTGTACAAG ACCCTCCTCA CCATGATCAT AGAGCACGTC
CCGCCGCCGA ACGTTGCCCA GAAGTACAGA ATCCCGAGGC TTTGGCGTGG CGACTTAAAC
AGCGAGGTGG GCAAGGCGAT GCTGGAAGCC GACCCCAACG GCCCGACGGT GATAGCAGTA
TCCAAGGTAA ACAAGGATCC CCACGCCGGT TTGATCGCCA CAGGGCGTGT CTTCTCGGGG
ACTATCAGGG AAGGCGACGA GGTCTACATC ATCGGGAGAA AGATGAAGAA GAAAGTCCTC
CAGACGTACA TCTACATGGG ACCCACGAGG ATAATCGTGC CTTACATGCC CGCCGGCAAC
ATAGTGGCGC TGATGGGCGT AGACGAGGCA AGGGCAGGAG ACACGCTGGT GGACCCAAGG
CTGACCGAGG TACCGCCTTT CGAGAAGATG AGGTACATTG CGGAGCCCGT GGTGACGGTC
GCAATTGAGC CGAAGAACCC GGCGGAGCTG GCAAAGCTGG TGGAGGCCTT AAAGGATTTG
GTGATTGAGG ACCCAACTCT CGACTTGAAG ATTGACCAGG AGACTGGGCA GATCCTCCTC
TCAGGCGTCG GCACGCTCCA CTTGGAGATC GCAACGTGGT TGCTTAAGGA GAGGGCTAAG
ACAGAGTTTA CGGTATCGCC GCCGTTGATA AGGTTTAGGG AGACTGTCAG GGAGAGGTCC
CAGGTGTGGG AGGGCAAGTC GCCGAATAAG CACAACAAGC TTTACTTCTA CGTGGAGCCT
CTCGACGAGA CTACGGTGGA GCTAATCGCC ACGAAGGAGA TTACAGAAGA GCAGGACCCG
AGGGAGAGGG CCAAGATTCT GAGGGAGAAG GCGGGCTGGG ACACCGACGA GGCTAGGGGC
ATCTGGGCGA TCGACGACCG CTACTTCAAC GTCATTGTGG ACAAGACGAC AGGTATCCAG
TACCTCCGCG AGATTAGGGA CTACATTGTC CAGGGCTTCC GCTGGGCAAT GGAGGCTGGG
CCGCTGGCCC AGGAGCCTAT GAGAGGGGTT AAGGTAGTTC TCGTCGATGC CGTCGTCCAC
GAGGACCCTG CCCACCGCGG ACCTGCGCAG ATAATGCCTG CAACTAAAAA CGCCATCTTC
GCCGCCGTGC TTTCAGCGAG GCCGACACTT CTTGAGCCAT TAGTGAGGCT AGACATAAAG
GTGGCCCCTG ACTACATCGG ATCTGTTACC TCCGTGCTGA ATAAGCATAG GGGCAAGATC
CTCGACATGA CTCAGCAGGA GTACATGGCC TACCTTAGGG CTGAGTTGCC GGTGCTTGAG
TCTTTCACAA TCAGCGATGA GCTCCGCGCC GCCGCAGCTG GCAAGATCTT CTGGTCTATG
CAGTTCGCGA GGTGGGCTCC TTACCCAGAG TCTATGCTTG TCGACTTCGT GAAGCAGTTG
AGGAAGAAGA AGGGGCTGAA GGAGGATATA CCAAAGCCTA CAGACTTCGT CGAGGTCTTT
TAA
 
Protein sequence
MSSAVRIVEK QLDEILAIAR NPAQIRNAGT LAHVDHGKTT TTDSLLMGAG LLSPKVAGKA 
LAMDFVAIEQ LRQMTVKAAN ISLYFEYGGK PYLVNFVDTP GHVDFTGHVT RSLRVMDGGL
VVVDSVEGVM TQTETVVRQA LEEYVRPVLF INKIDRLIKE LRLSPQEIQQ RILTIVKDFN
ALIDMFAPPE FKDKWKVDPA KGQVALGSAL HKWGITIPMA QKAGLKFSNI VDAYEKGYVD
KLGEEFPLYK TLLTMIIEHV PPPNVAQKYR IPRLWRGDLN SEVGKAMLEA DPNGPTVIAV
SKVNKDPHAG LIATGRVFSG TIREGDEVYI IGRKMKKKVL QTYIYMGPTR IIVPYMPAGN
IVALMGVDEA RAGDTLVDPR LTEVPPFEKM RYIAEPVVTV AIEPKNPAEL AKLVEALKDL
VIEDPTLDLK IDQETGQILL SGVGTLHLEI ATWLLKERAK TEFTVSPPLI RFRETVRERS
QVWEGKSPNK HNKLYFYVEP LDETTVELIA TKEITEEQDP RERAKILREK AGWDTDEARG
IWAIDDRYFN VIVDKTTGIQ YLREIRDYIV QGFRWAMEAG PLAQEPMRGV KVVLVDAVVH
EDPAHRGPAQ IMPATKNAIF AAVLSARPTL LEPLVRLDIK VAPDYIGSVT SVLNKHRGKI
LDMTQQEYMA YLRAELPVLE SFTISDELRA AAAGKIFWSM QFARWAPYPE SMLVDFVKQL
RKKKGLKEDI PKPTDFVEVF