Gene Rcas_3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3362 
Symbol 
ID5540861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4391237 
End bp4392247 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID640895480 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001433430 
Protein GI156743301 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0453282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCG TCGGTATTTT TGGCGCAACC GGTTATGCCG GGTATGAACT GCTCCGCTGG 
CTGCGGCGTC ACCCGGAAGT GCGCGTCGTC TTCGCCGCTT CCGAGTCGTC GGCTGGAGCA
TCGCTCGCCG ATGTCATTCC CGGTCCGCTC GATACACCAC TGATTGCGCC CGACGAAGCG
CCGCTCGGCG ATGTCGATCT GGTCTTTCTG GCGTTGCCGC ACGGCGCTGC GGGAAAGATG
GCGCGGCGCG CACGCGCGGC TGGCGTGCGT GTGATCGATT TCTCCGCCGA TTTTCGACTG
GCCACGCCCG AATCTTACCG GCGCTGGTAT GGGCACGATC ATCCCGCGCC CGAATTGCTG
CCGGCGCCCT ACGGTTTGCC GGAACTCAAC CGCGCGGCGC TGCGCGGCGC GATGCTGATC
GCTAATCCTG GCTGTTATCC GACCGGTATG CTGCTCGGTG TTGCGCCGCT TCTCATGGCC
GGCGCGTTGA CCGACCCGCT GATCATCGTC GATGCCAAGT CGGGAGTGTC GGGGGCAGGG
CGCGCGCCGA AGCAGAATAC GCACTTCGTC GAAGTGAACG AGAATCTTGC GCCGTACAGT
ATCGGGCAGG TTCACCGTCA TGTTGGCGAG ATGCGCCAGG AAGCGCAGCG GATCGCGCGC
GGCGTGGCGC CGGAGATCGT GTTCACGCCG CAGTTGCTGC CGGTGAGCCG CGGTATCCTG
AGCACGATCT ACCTGCGCAT ACCGGACGAC TGGAGTGAGG ATCGGGTGCG CGCGCTGTAC
CGTGAGCAGT ATGCTGACGA ACCATTCGTG CGGGTGCTGT CGGCGGGCGC GCTGGCGACT
CTGGGGCACA CAACAGACAC GAATGTCTGC GCTATCTCGC TGACCCTGGC GCGACCGGGG
TTGCTCATCG TTGTCTCCAG TGAAGACAAT ATGGTCAAAG GCGCTGCTGG TCAGGCGATC
CAGAACATGA ACCTGATGTT TGGGCTGGAT GAGACAACCG GATTGGTGTA G
 
Protein sequence
MIRVGIFGAT GYAGYELLRW LRRHPEVRVV FAASESSAGA SLADVIPGPL DTPLIAPDEA 
PLGDVDLVFL ALPHGAAGKM ARRARAAGVR VIDFSADFRL ATPESYRRWY GHDHPAPELL
PAPYGLPELN RAALRGAMLI ANPGCYPTGM LLGVAPLLMA GALTDPLIIV DAKSGVSGAG
RAPKQNTHFV EVNENLAPYS IGQVHRHVGE MRQEAQRIAR GVAPEIVFTP QLLPVSRGIL
STIYLRIPDD WSEDRVRALY REQYADEPFV RVLSAGALAT LGHTTDTNVC AISLTLARPG
LLIVVSSEDN MVKGAAGQAI QNMNLMFGLD ETTGLV