Gene TM1040_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1123 
SymbolargC 
ID4077244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1208651 
End bp1209679 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content59% 
IMG OID638006427 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_613118 
Protein GI99080964 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATA ACGTGGCAAT TCTTGGTGCG TCCGGCTATA CCGGCGCAGA GCTGATCCGG 
CTGATTTCTC AGCACCCGAG CATCACCATC AAAGCCCTCG CAGCCGAGCG CAAGGCAGGC
ATGGAGATGG CAGATGTATT CCCGCATCTG CGTCATCTTT CGCTGCCAAC CCTTTGCAAA
ATAGACGAAA TCGACTTCGC ACAGATCGAT CTGTGTTTCT GTGCTCTGCC GCATAAGACC
AGCCAAGAGG TGATCGCGAA ACTGCCGGGT GATCTGAAAA TCGTGGATCT GTCGGCGGAC
TTCCGGCTGC GCGACCCGGA GGCCTATGAA AAATGGTACG GCAACCCGCA TGCGGCGCTC
GAGCAGCAGC AGGAGGCGGT CTATGGTCTC ACCGAATTTT ATCGCGATGA GATTAAGGGC
GCGCGGCTGG TGGCGGGCAC GGGCTGCAAT GCGGCCACCG GGCAGTTTGC GCTGCGGCCG
CTGATCGCGG CGGGTGTGAT CGACCTTGAT GAGATCATCC TCGATATGAA ATGTGCGGTC
TCCGGCGCCG GGCGGGCGCT CAAGGAAAAC CTGCTGCATG CGGAGCTGAG CGAAGGCTAT
AACGCCTATG CCATTGGTGG CACCCACCGG CATATCGGCG AGTTCGATCA GGAGTTCTCG
GCTATCGCTG GGCGGCCCGT GAAGGTCCAG TTCACCCCGC ATCTGCTGCC GGTGAACCGG
GGGATCCTTG CGACCACCTA CGTCAAAGGC GATGCTCAGG CGATCTATGA GACATTTGCC
AAGGCCTATG CCGATGAGCC CTTTGTCGAG CTGCTGCCTT TTGGCGAGGC GCCTTCCACC
CATCATGTGC GCGGGTCAAA CTTTGTACAT ATCGGGGTCA CTGCCGATCG CATCGCAGGG
CGCGCAATTG TTATCGTGGC GTTGGATAAC CTGACAAAAG GTAGCAGCGG TCAGGCCTTG
CAGAATGCAA ACCTGATGTT AGGTGAGGAC GAGACAGCCG GGCTGATGAT GGCACCGCTG
TTCCCCTGA
 
Protein sequence
MTHNVAILGA SGYTGAELIR LISQHPSITI KALAAERKAG MEMADVFPHL RHLSLPTLCK 
IDEIDFAQID LCFCALPHKT SQEVIAKLPG DLKIVDLSAD FRLRDPEAYE KWYGNPHAAL
EQQQEAVYGL TEFYRDEIKG ARLVAGTGCN AATGQFALRP LIAAGVIDLD EIILDMKCAV
SGAGRALKEN LLHAELSEGY NAYAIGGTHR HIGEFDQEFS AIAGRPVKVQ FTPHLLPVNR
GILATTYVKG DAQAIYETFA KAYADEPFVE LLPFGEAPST HHVRGSNFVH IGVTADRIAG
RAIVIVALDN LTKGSSGQAL QNANLMLGED ETAGLMMAPL FP