Gene Pars_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2333 
Symbol 
ID5056281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2086120 
End bp2087331 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID640469885 
ProductS-adenosylmethionine synthetase 
Protein accessionYP_001154529 
Protein GI145592527 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1812] Archaeal S-adenosylmethionine synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.030297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00106095 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGTAG TCGTAGAGCA GGTGGACAAA ACGCCTGTGG CTAGGCGGCT TGTGGAAATT 
GTAGAGAGGA AGGGCCAGGG CCACCCCGAC TACATAGCGG ACGGCATCTC GGAGTGGGTG
AGCCGCTATC TTTCTAGGTA CTACCTTCAG AGGTTCGGAG TCATCTTGCA CCATAATGTA
GACAAGACGC TTGTGGTAGG TGGGCAGGCC GCGCCGAGGT TTGGTGGTGG CGAAGTTTTG
CAGCCTATCT ATGTTTTGGT GTCTGGGAGG GCTACGTACG AGGTGAGGAC TAGGGATGGC
GTTGTGAAAA TCCCGTTGGG GCCTGTGGTG ATCCAGGCGG CTAGGGATTG GATTAAGCAA
CACTTTAGGT TTTTAGACCC AGACGCCCAT GTGGTGATAG ACTACCGAAT AGGCCAAGGA
TCCGCAGATC TTGTAGGAAT ATACGACTTA GGCGTTACTG GGGTTCCTCT TGCAAACGAC
ACCTCGGTGG GTGTTGGCTA CGCTCCGTTT ACTCCTCTGG AGGAGCTTGT GTATAAAACC
GAGAGGTTGT TGAATTCACG CGACTTTAAG GCAAAGTACC CGGAGGTGGG GGAGGACGTA
AAGGTTATGG GAGTGAGGGT AGGCAAGGAC GTGAGGTTGA CAGTAGCCGC CGCAATGATT
AGCAGGTTTG TCAAAGACAA GAGCCACTAC CTCTCTGTAA AGGAGGAGGT GAAAAAAGTA
ATAGAAGACC TCGCCGCGAA GATCGCCCCC GACTACAACG TAGATGTAAC TATAAACGCC
GCCGACAAGC CGGAGTTTGA CATATTCTAT CTCACAGTCA CAGGTACCTC TGCAGAGCAC
GGCGACGACG GCATGACCGG CAGGGGTAAC AGAGCCAACG GCCTTATTAC CCCTATGCGG
TCTATGTCGC TGGAGGCGGC GGCGGGCAAA AACCCGGTGA GCCACGTGGG GAAGATATAC
AACGTGGTTG CCCAACGCAT AGCTGATCGG GTTTACAAAG AAGTCAAGAA CATAATAGAG
GTGTATGTCG AGATTGTGTC GCAGATTGGT AAACCTATTA ACGAGCCTAA GATTCTAAAC
GTCGAAGTGA TTAAAGAGGG GGCGCTGACA AGCGACACAA GAAACGAAAT AGAAGCGATA
GCCAGGGAGG AGCTCCAAAG AATAACTAAG GTGACTGACT TAATTCTCAG CGGCGAGGTG
TCGCTATACT AG
 
Protein sequence
MTVVVEQVDK TPVARRLVEI VERKGQGHPD YIADGISEWV SRYLSRYYLQ RFGVILHHNV 
DKTLVVGGQA APRFGGGEVL QPIYVLVSGR ATYEVRTRDG VVKIPLGPVV IQAARDWIKQ
HFRFLDPDAH VVIDYRIGQG SADLVGIYDL GVTGVPLAND TSVGVGYAPF TPLEELVYKT
ERLLNSRDFK AKYPEVGEDV KVMGVRVGKD VRLTVAAAMI SRFVKDKSHY LSVKEEVKKV
IEDLAAKIAP DYNVDVTINA ADKPEFDIFY LTVTGTSAEH GDDGMTGRGN RANGLITPMR
SMSLEAAAGK NPVSHVGKIY NVVAQRIADR VYKEVKNIIE VYVEIVSQIG KPINEPKILN
VEVIKEGALT SDTRNEIEAI AREELQRITK VTDLILSGEV SLY