Gene Pars_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0966 
SymbolargC 
ID5055468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp858940 
End bp860001 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content61% 
IMG OID640468522 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001153198 
Protein GI145591196 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.170153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000021717 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTTTGA GGACATGTAT AGTCGGGGCG TCGGGCTTTG TTGGCGGTGA GCTATTGCGC 
ATTCTTCTCC AGCACAGCGG GGTCGAGGTG GTGTGCGCCA CGTCTAGGAA GTTTAAGGGG
GAGTACATCT ACAGGGTGCA CCCCCACTTG AGGGGGTTTA CCCAGCTTAA GTTCGTGGAG
CCCACTATCG ACGCTGCGCT TAAGGCAGAC GTGGTCTTCC TCGCTCTGCC CCACGGGGAG
TCTGTGAAGT GGGTGCCCAA GCTCTACGAG TCCGGAGTCG CCGTGTTTGA CCTTAGCGCC
GATTTTAGGC TAAAGGACCC CAACGCCTAC GTGGAGTGGT ACAAGTGGCC GCAGCCCCAT
CCCTACCCCG ACTTGTTGCA GAAGGCGGTG TACGGCCAGC CTGAGCTCCA CAGGAATGAG
CTGGTCGGCG CCAAGCTGGT GGCAGTCCCC GGTTGCATGG CCACAGCCTC TATCCTCATG
CTGGCCCCCC TAGCTAAGCA CGGCTTCCTC GGCAGGGCGC CTCCCGTGGT AGACGCCAAG
ATAGGGTCAA GCGGCGCCGG CGCCGAGGGG TCCATCGTCG ATCTCCACAG CTTCCGCACC
TACGTGGTTA GGCCCTACGA GCCTGTCCAC CACCGCCACA TCGCGGAGAT TGAGCAGGAG
CTTAGCCTAC TGGCTGGGAA GAAGATCAGG GTGGCCTTTA CCCCCCACGC CGTGGATTTG
GTGAGGGGAA TCTTCGCCAC CGGCCACACC TACGTGGAGA AGCTCCCCAC CGAGGCCGAC
ATGTGGAAGA TGTACCGCGC CCTCTTCGGC GACTCGAAGT TCATTAGGAT TGTGAAGGAC
AGGCTGGGGA TCGCCCGCTA CCCCAACACC AAGTATGTTA TAGGCTCAAA TTTCGTTGAC
ATCGGCTTCG AGCTAGACCC GAGGCTCAAC AGGGTAGTCA CCTTCGCCGC GATAGACAAC
CTCGTAAGGG GCGCCGCGGG ACAGGCGGTG CAAGCCTTCA ACGTGGCCTT TGGCTTCCCC
GAAGACGAGG GCCTGAGATA CATCCCGCTG GCTCCGGTAT GA
 
Protein sequence
MTLRTCIVGA SGFVGGELLR ILLQHSGVEV VCATSRKFKG EYIYRVHPHL RGFTQLKFVE 
PTIDAALKAD VVFLALPHGE SVKWVPKLYE SGVAVFDLSA DFRLKDPNAY VEWYKWPQPH
PYPDLLQKAV YGQPELHRNE LVGAKLVAVP GCMATASILM LAPLAKHGFL GRAPPVVDAK
IGSSGAGAEG SIVDLHSFRT YVVRPYEPVH HRHIAEIEQE LSLLAGKKIR VAFTPHAVDL
VRGIFATGHT YVEKLPTEAD MWKMYRALFG DSKFIRIVKD RLGIARYPNT KYVIGSNFVD
IGFELDPRLN RVVTFAAIDN LVRGAAGQAV QAFNVAFGFP EDEGLRYIPL APV