Gene Pars_1877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1877 
Symbol 
ID5055729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1679996 
End bp1681219 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content51% 
IMG OID640469423 
Productnucleotidyl transferase 
Protein accessionYP_001154080 
Protein GI145592078 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAGAA TTGCCATCAT CCCTGTAGGG GGAGAGGCGG TTAGGCTCAG ACCCCTCACC 
GCGGAGACCT CGAAGGCTAT GGTCCGCTTC CTTAATCGGC CTTTGATAGA GCTTTCAATT
TTGCACCTCG CCCGCCAGGG GGTTGAGGAG TTCTACTTCG GCGTGAAGGG GTATTACAAC
TACCGTGACA TATACGATTA CTTCCGCGAG GGGAGGTGGT TTGCGGAGAA ATATGGCGTT
GGTATAAGGG TCAGGTATAT GCCGAGAGTA GAAACGCGGG GGAATGCGGA GGCTGTTTAT
GCCACCCTCA TGTATTATGA TATTAGGGAA CCCGTCTTGG TTATTCAGGG CGACAACGTT
TTCCAGCTTG ATGTGAAAGA TATGTACGAG TTCCATAGAT CAAAGAAGGC ATTTATAACA
ATCGCGCTTA AGGAGGAGAC AAGCGATCTG AGCGAATTCG GTGTGGCGGC GATTGGTGAG
GATATGCGCA TTTTGAAGTT TGTCGAAAAA CCAAAAAGAA GAGAAGATGC GCTTAGTAAT
TTAGTAAACA CCGGTATATA CCTCCTCTCT GAGGACTTCA AAGACTTCTT CAGCGGGGAG
ATGGGAAGCA AGTTGTACTC CGAGGGGAGG CTGGACTTCG GCGGTGACGT CATACCTACT
GTGATAGAGG CCGGCTTGCC TGTGTACGGT TATACAAGTA GGGGCTACTG GTTCGACGTG
GGGACACCCG AGCGCTACCT CAAGGCAGTC CAGTTCCTTC TTAGACAACT AACGCCGGGG
GAGCTAGAGG CGGAGGAGAT ACTGCCGTCG GTATACGCGC AGGGGGTTAG CGAACAGTCG
AAGATATTAA AAGGCAAAAT CGCGGAGCGT ATTAAAAAAG GCGCGATTAA AGCAGAGGGC
CACATCCTAC TGGGAAGACA CGTCCAACTC GGCGACAACG TCCACATACG CGACTCGGTT
ATCGACAACT ACGTAGTAGT AGGTGACAAC AGCACGATAG AGGATTCGGT AGTGATGGAC
CGGTCGCTGA TAGGGAGAAA TGTGACGATA AGACGGTCAA TAATCGGGCG CCACGTCTAC
GTGAAAGACG GGTCTGTTAT AGAAGACTCA GTCGTAGCAG ACAACGTGGT GGTGGGCGAA
GAGGCCTCTC TGAGAAGGGT AAAGGTGTGG CCGCACAAGA CGCTGGAAAA GGGAGTGAGA
CTTGAGGGCT TTTCTCTGAT CTAA
 
Protein sequence
MVRIAIIPVG GEAVRLRPLT AETSKAMVRF LNRPLIELSI LHLARQGVEE FYFGVKGYYN 
YRDIYDYFRE GRWFAEKYGV GIRVRYMPRV ETRGNAEAVY ATLMYYDIRE PVLVIQGDNV
FQLDVKDMYE FHRSKKAFIT IALKEETSDL SEFGVAAIGE DMRILKFVEK PKRREDALSN
LVNTGIYLLS EDFKDFFSGE MGSKLYSEGR LDFGGDVIPT VIEAGLPVYG YTSRGYWFDV
GTPERYLKAV QFLLRQLTPG ELEAEEILPS VYAQGVSEQS KILKGKIAER IKKGAIKAEG
HILLGRHVQL GDNVHIRDSV IDNYVVVGDN STIEDSVVMD RSLIGRNVTI RRSIIGRHVY
VKDGSVIEDS VVADNVVVGE EASLRRVKVW PHKTLEKGVR LEGFSLI