Gene Nmul_A0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0228 
Symbol 
ID3786310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp242379 
End bp243566 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content56% 
IMG OID637810300 
Product8-amino-7-oxononanoate synthase 
Protein accessionYP_410928 
Protein GI82701362 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00625814 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTGC TAGATAAACT GGATGCCGTG GCGGCAGCGA GAAAAGCCCT GCTGCCTGAG 
GGTGTTGAGG TTTTTGGAAC ACCCATAGAA GAAGTTTATT CCTCGACCGA GGCCAGGATC
GGGGATCACC GGATATTGTT TCTGGGAACA AATAATTATC TGGGCCTTAC CTTCGCTCAG
GAATGCCGGG ACGCAGCGCA GGAAGCGATT CACAATGAAG GCACGGGTAC AACCGGCTCG
CGCATGGCCA ATGGCAGTTA CAGCGGCCAT CGTGCCCTCG AGCGCGAGTT TGCCGAGTTT
TATCAATGCG GGTCCTGCAT CGTATTTACC ACGGGATATC AGGCCAACCT TGCAACCATC
TCAGGGCTGG CCGGGGCCGG CGACATCGTT CTTATCGATG GCGATTCCCA CGCAAGCATC
TACGATGGTT GCCGTTTGAG CGGCGCCGAA ATAATCCGCT TCAGACACAA CGATGCGGCC
GATCTGGAGA AGCGTCTCCG CCGCCTGGGA GAAAGATCGC GCTCCACCCT CATCATTGCC
GAGGGCATCT ACAGTATGCT GGGAGACCGC GCGCCGCTGG CGGATATCGT CAAGGTAAAG
GATGCGTATA ACAGCACGCT TCTTCTTGAT GAAGCGCATT CCCTGGGGGT ACTGGGTGAG
ACCGGACAGG GTCTGGTAGA AGAAACAGGC CTCCTCGATA GAGTAGACTT CATCACAGGC
ACCTTCAGTA AAAGCTTGGG GGGGATCGGC GGCTACTGTG TGAGCAATCA TCCGCAACTG
GATCAGTTGC GCTATGTGAG CCGGCCTTAT ATTTTCACTG CGTCGCCTAC GCCTGCCACC
ATCGCCTCAA CCCGTGCAGC TCTCAAGCTG CTGAGGGAGG GGGTCGAATT ACGCCGACAG
CTCTGGAAGA ATGTGCACCA GCTCTACTCG CAACTCAAGG AACTGGGTTA TCGCCTGGGA
CCGGAACCCA GCCCTGTTAT CGCAACAATC CTTGAAACGC CGCAACAGGC GCTGGCACTG
TGGAAAGGAC TGCTTGAACA GGGTATCTAT GTGAATCTGG TCTTGCCGCC CGCAACGCCG
GAGGGCAATT CGCTGGTGCG TTGCAGTGTA AGTGCCGTTC ATACCAGCGA GCAGATGGAT
CATGTCGGCA AAACTTTCGC CATGTTGCGC GAGACTATTT TCCAGTAA
 
Protein sequence
MSLLDKLDAV AAARKALLPE GVEVFGTPIE EVYSSTEARI GDHRILFLGT NNYLGLTFAQ 
ECRDAAQEAI HNEGTGTTGS RMANGSYSGH RALEREFAEF YQCGSCIVFT TGYQANLATI
SGLAGAGDIV LIDGDSHASI YDGCRLSGAE IIRFRHNDAA DLEKRLRRLG ERSRSTLIIA
EGIYSMLGDR APLADIVKVK DAYNSTLLLD EAHSLGVLGE TGQGLVEETG LLDRVDFITG
TFSKSLGGIG GYCVSNHPQL DQLRYVSRPY IFTASPTPAT IASTRAALKL LREGVELRRQ
LWKNVHQLYS QLKELGYRLG PEPSPVIATI LETPQQALAL WKGLLEQGIY VNLVLPPATP
EGNSLVRCSV SAVHTSEQMD HVGKTFAMLR ETIFQ