Gene Daro_3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3640 
SymbolhemH 
ID3568285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3914242 
End bp3915174 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content64% 
IMG OID637682113 
Productphosphoribosylaminoimidazole-succinocarboxamide synthase 
Protein accessionYP_286839 
Protein GI71909252 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0152] Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase 
TIGRFAM ID[TIGR00081] phosphoribosylaminoimidazole-succinocarboxamide synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.00000209948 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCTC CGCTTTTCGA ATCCACCATC ACCAGCCTGC CCCTGATCAA CAAGGGCAAG 
GTCCGCGACA TCTACGCCGT CGACGCCGAC AAGCTGCTGA TCGTCACCAC CGACCGCCTG
TCCGCCTTCG ACGTCATCCT GCCGGACCCG ATTCCGCGCA AGGGTGAAGT CCTGCAGGCT
GTCGCCAATT TCTGGTTCGA CAAACTCGGC CACATCGTCC CGAATCAACT GACCGGCATC
GATCCCGAAA CCGTCGTTGC TGAAAACGAA CGTGAGCAAG TCCGTGGCCG TGCCGTCGTC
GTCAAGCGCC TGAAACCGCT GCCGATCGAA GCCGTCGTCC GTGGCTACGT GATCGGTTCC
GGCTGGAAGG ACTATCAGGA AACCGGTGCC ATCTGCGGCA TCGCGCTGCC GGCCGGCCTC
AAGATGGCCG CCAAGCTGCC CTCTCCGATC TTCACGCCGG CCACCAAGGC CGCCGTCGGT
GACCATGACG AGAACGTCTC CTTCGCCACT GCCCAGGCCA ACTGCGCCGC CGACCTCGCC
GAAGCGCTGG CCGGCACCGG CAAGAACGGT GCCGGACTGG CCGACGAAGC CCGCATCGCC
GCCATCCGCC TGTACGAAGA AGCCTCCGCC TACGCCCGTG GCCGCGGCAT CATCATCGCC
GACACCAAGT TCGAATTCGG CATCGATGCC GCCGGCACCC TGCACCTGAT CGACGAAGCC
CTGACCCCGG ATTCCTCGCG TTTCTGGCCA GCCGACCATT ATCAGGAAGG CAGCAACCCG
CCGTCCTACG ACAAGCAATA CGTCCGCGAT TACCTCGAAA CCCTGGACTG GGGAAAAGTC
GCCCCCGGCC CCAAACTGCC GGCCGACGTC ATCGCCCGCA CCAGCGCCAA GTACATCGAA
GCCTACGAAA AGCTGACCGG CAAGACGCTG TAA
 
Protein sequence
MTAPLFESTI TSLPLINKGK VRDIYAVDAD KLLIVTTDRL SAFDVILPDP IPRKGEVLQA 
VANFWFDKLG HIVPNQLTGI DPETVVAENE REQVRGRAVV VKRLKPLPIE AVVRGYVIGS
GWKDYQETGA ICGIALPAGL KMAAKLPSPI FTPATKAAVG DHDENVSFAT AQANCAADLA
EALAGTGKNG AGLADEARIA AIRLYEEASA YARGRGIIIA DTKFEFGIDA AGTLHLIDEA
LTPDSSRFWP ADHYQEGSNP PSYDKQYVRD YLETLDWGKV APGPKLPADV IARTSAKYIE
AYEKLTGKTL