Gene Daro_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1863 
SymbolargC 
ID3569559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2002484 
End bp2003434 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content62% 
IMG OID637680334 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_285079 
Protein GI71907492 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01851] N-acetyl-gamma-glutamyl-phosphate reductase, uncommon form 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.000000695881 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTATA AAGTCTTTGT CGACGGCCAG GAAGGCACTA CCGGCCTGCA GATCAATGAA 
TACCTCGCCA AGCGCAGCGA TGTCGTGCTG CTCAAGATCG ACGCCGACAA GCGCAAGGAT
CTTGCCGAGC GCAAGCGCCT GATCAACGAA TCCGACGTGA CTTTCCTGTG CCTGCCGGAT
GACGCAGCCA AGGAATCGGT GTCTCTGGTC GACAACCCGA ACACCTGCGT GATCGATGCC
TCGACGGCGC ACCGTGTCAA TCCGGCGTGG ACCTTCGGCT TGCCCGAGCT GGCCAAGGAC
CAGCGCGCCA AGATCAAGGC CTCCAAGCGC ATCGCCAACC CTGGCTGCCA TGCCAGTGCC
TTCATCCTGG CCCTGCGCCC GCTGGTCGAA GCCGGCCTGC TGCCTGCCGC GACGCAGATC
GCTGCCAACT CCATCACCGG CTACTCCGGC GGCGGCAAAT CGATGATCGC CGATTACCAA
AAGGCCAGCG AAAGCCCGAC CCAACTCAAG GCGCCGCGCC CCTACTCGCT CGCCCTGGCA
CACAAGCATC TGCCGGAAAT GCAGGCCTAC ACCGGCCTGA CCGTCGCGCC GATTTTCCAG
CCCATCGTCG GACCGTTCTA CAAGGGCCTG GCCGTCACCG CCTACATTCA CCCGCAGCAG
TTCACCCGCC CGGCAACGCC GACTGACGTG CAGAAGATCA TCGCCGACTA CTACGCCGGT
GAGCCGTTCA TCCGCGTGCT GCCGGTTGAT CTCGATGCGA CCACGGAAGG CGGTTTCTAC
AATGTCGAAG CCAACAACGA CACCAACCGT GTCGACATCG CCGTCTTCGG CAATGAAGAG
CGCATGTTGA TCGTTGCCCG CCTCGACAAC CTCGGCAAGG GTGCTTCCGG TGCTGCCGTG
CAGGCGATGA ACGTGCATCT GGGCGTTGAG GAAAGCCTCG GCCTGGTCTG A
 
Protein sequence
MTYKVFVDGQ EGTTGLQINE YLAKRSDVVL LKIDADKRKD LAERKRLINE SDVTFLCLPD 
DAAKESVSLV DNPNTCVIDA STAHRVNPAW TFGLPELAKD QRAKIKASKR IANPGCHASA
FILALRPLVE AGLLPAATQI AANSITGYSG GGKSMIADYQ KASESPTQLK APRPYSLALA
HKHLPEMQAY TGLTVAPIFQ PIVGPFYKGL AVTAYIHPQQ FTRPATPTDV QKIIADYYAG
EPFIRVLPVD LDATTEGGFY NVEANNDTNR VDIAVFGNEE RMLIVARLDN LGKGASGAAV
QAMNVHLGVE ESLGLV