Gene Nmul_A2194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2194 
Symbol 
ID3786219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2492832 
End bp2493848 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content56% 
IMG OID637812281 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_412878 
Protein GI82703312 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCATAG TCATGAATAA CGGCGCCACC GAAGAGCAGA TCGAAACGGT GGTTGCAAAA 
ATTCGAAGCT TCGACCTGGA CGCGAACGTT TCGCGTGGCA CCGAGCGTAC CGTAATCGGC
GCAATTGGCA ATGAGCGCAA GCTCAGTCCC GAAATGTTCG ATACCCTGAG CGGTGTCGAA
TATTCCATGC ATATCGTCAA GCAGTACAAG ATCGTGTCGC GCGAATGGCA CAGGGATAAT
TCGATAATCA ATGTGGGAGA TGTGGCGATT GGCGGGGACC AGGTGCAGGT GATCGGAGGT
CCCTGTTCAG TGGAGACGCA AGAGCAAATG GATTCGGCGG CCCAACATGT CTCGGATGCC
GGATGCCGGC TGATGCGGGG CGGCGCCTTC AAGCCCCGTA CCAGTCCCTA TACCTTCCAG
GGCAATGGCG AAGAAGGATT GAAAATGTTC CGCAAGGCTG CGGATAAGCA CAACCTCCGG
ATCGTCACGG AATTGATGGA TGCGCGCATG CTGGACACTT TCCTCGAGTA TGACGTGGAT
GTGATCCAGA TCGGTACACG CAGCATGCAG AACTTCGAAC TTTTGAAAGA AGTGGGGCGC
ATCAACAAGC CCGTGATACT GAAGCGCGGG ATGTCCGCGA CCGTTTCCGA GTGGCTCATG
GCCGCGGAAT ACATCGCCGC GGGCGGCAAC CATAACATCA TTTTCTGCGA ACGCGGCATA
CGCACCTTTG AAACCGCTTA TCGCAACGTA ATGGATGTAA CCTGCATCCC CGTGCTAAAA
AAAGAAACGC ACTTGCCGGT AATCGTCGAT CCCTCCCATG CGGGGGGAAA GGCATGGATG
GTGCCCGCCC TGGCGCGCGC GGCAGTTGCA GCGGGAGCGG ATGGCCTGCT GGTAGAGACG
CATCCCAATC CATGCGAAGC CTGGTGCGAC GCAGACCAGG CGTTGAATCC CGAGGAATTC
CGCGATCTGA TGGGATCGCT GCAAGGAATA GCGGCAGTAA TCGGACGGAG TCTGTGA
 
Protein sequence
MIIVMNNGAT EEQIETVVAK IRSFDLDANV SRGTERTVIG AIGNERKLSP EMFDTLSGVE 
YSMHIVKQYK IVSREWHRDN SIINVGDVAI GGDQVQVIGG PCSVETQEQM DSAAQHVSDA
GCRLMRGGAF KPRTSPYTFQ GNGEEGLKMF RKAADKHNLR IVTELMDARM LDTFLEYDVD
VIQIGTRSMQ NFELLKEVGR INKPVILKRG MSATVSEWLM AAEYIAAGGN HNIIFCERGI
RTFETAYRNV MDVTCIPVLK KETHLPVIVD PSHAGGKAWM VPALARAAVA AGADGLLVET
HPNPCEAWCD ADQALNPEEF RDLMGSLQGI AAVIGRSL