Gene RoseRS_4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4052 
Symbol 
ID5211035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5074074 
End bp5075087 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content63% 
IMG OID640597640 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001278346 
Protein GI148658141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.098628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGCG TCGGTGTTTT TGGCGCCACC GGGTATGCCG GTTACGAATT GCTGCGATGG 
CTGCGACGCC ATCCAGAAGC GCGCGTGGTG TTCACAGCGT CCGAGTCGTC AGCCGGGGCG
TCGCTCGCCG ATGTTGTGCC GGGTCCGCTC GACGCGCCAT TGATTGCACC CGATGAAGCG
CCGCTGGCGG ATGTCGATCT GGTTTTCCTT GCGCTACCGC ATGGCGCTGC CGGGAAGATG
GCGCGACGCG CACGCGCCGC CGGGGTGCGG GTCATCGATT TCTCCGCCGA CTTTCGCCTG
ACGACGCCTG AAGCGTACCG GCGCTGGTAT GGGCATGAAC ACCCGGCGCC TGAGTTGCTG
CCAGCGCCTT ATGGTCTGCC GGAACTCAAT CGCGCCGCTC TGCGCAACGC GCCGCTAATC
GCCAATCCTG GCTGCTATCC GACCGGTGTG CTGCTCGGTA TCGCGCCGCT GCTGATGATG
GGCGCGTTGA CCGATCCGCT GATCATCGTC GATGCCAAGT CAGGGGTGTC GGGAGCGGGC
CGCGCGCCGA AACAGAATAC GCACTTCGTC GAGGTGAACG AAAACCTTGC GCCGTACAAC
ATCGGTCAGG TTCACCGACA CGTCGGCGAA ATGATGCAGG AAGCACAGCG CATCGCCTGC
GGCATAACGC CGGAGATTGT CTTCACGCCA CAACTTCTGC CAGTGAGTCG CGGCATTCTG
AGCACAATCT ACCTGCGCAT ACCGGACGAC TGGAGCGAAG ATCGGGTGCG GGCGCTGTAC
TGCGAACAGT ACGCTGACGA GCCATTCGTG CGGGTGCTGC CGACGGGCGC GCTGGCAACC
CTGGCGCACA CGACACATAC CAACGTCTGC GCCATCTCAC TGACCCTGGC GCGACCCGGG
TTGCTCATCG TGGTCTCCAG CGAGGATAAT ATGGTCAAAG GCGCAGCCGG GCAGGCGATC
CAGAACATGA ACCTGATGTT CGATCTGGAG GAGACGACCG GGCTGATGGG TTGA
 
Protein sequence
MIRVGVFGAT GYAGYELLRW LRRHPEARVV FTASESSAGA SLADVVPGPL DAPLIAPDEA 
PLADVDLVFL ALPHGAAGKM ARRARAAGVR VIDFSADFRL TTPEAYRRWY GHEHPAPELL
PAPYGLPELN RAALRNAPLI ANPGCYPTGV LLGIAPLLMM GALTDPLIIV DAKSGVSGAG
RAPKQNTHFV EVNENLAPYN IGQVHRHVGE MMQEAQRIAC GITPEIVFTP QLLPVSRGIL
STIYLRIPDD WSEDRVRALY CEQYADEPFV RVLPTGALAT LAHTTHTNVC AISLTLARPG
LLIVVSSEDN MVKGAAGQAI QNMNLMFDLE ETTGLMG