Gene Nmul_A1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1540 
Symbol 
ID3785613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1759287 
End bp1760189 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content54% 
IMG OID637811628 
Productdihydrodipicolinate synthase 
Protein accessionYP_412235 
Protein GI82702669 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.336283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACG TCGCAATAAA AGGCAGTCTG GTCGCCATTG TCACCCCGAT GCATGAAAAT 
GGCGAACTGG ATCTGGAGCG CTTCCAATCC TTGATCGACT TTCATGTGAC GGAAGGGACC
GATGGTATCG TGGTGGTAGG AACCACCGGC GAATCACCGA CCGTGGATTT CGAAGAGCAC
CATCTGTTGA TCAAAACCGC AGTGGAGCAG GCGGCAGGGC GGGTACCGGT AATTGCGGGA
ACCGGAGCCA ATTCCACGCG CGAGGCAATT GACCTTTCCA TCTATGCGAA GAATGCGGGA
GCGGATGCAA GCCTGTCGGT TGTACCGTAT TACAACAAGC CCACGCAGGA GGGTTTATAC
CAGCATTTCA GAGCGGTGGC GGAGGCTGTG GATATACCGC AGATACTATA CAACGTGCCC
GGCAGGACGG TGGCAGATAT TGCCAACGAT ACGGTCCTTC GTCTTGCGCA AATTCCCAAC
ATTGTCGGAA TCAAGGATGC AACGGGTGAT ATCGGTCGCG GATTCGATCT GTTGTGCCGT
GCTCCCGAAG ATTTTGCAAT CTATAGCGGC GATGATGCCA GTGCCCTGGC TTTGTTGCTG
CTCGGCGGGC ATGGCGTTAT TTCCGTCACC GCCAACGTGG CGCCGAAGCT CATGCATGAG
ATGTGCATTG CGGCATTTGC CGGTGACCTG GCTGCTGCCC GCGCTGCAAA CAGAAAGCTT
TTGAGATTGC ATCTGGATTT ATTCATAGAG GCCAATCCTA TTCCTGTGAA ATGGGCGGTT
GCGCAAATGG GATTGATAGG CGAGGGGTTG CGGTTGCCAC TCACACCGTT GTCGAATCGA
TATCATCAGA CTCTCAGGGA AGCGATGAGC GAGGCGGGAA TCGATTTGGC GATATCTGTT
TAA
 
Protein sequence
MDNVAIKGSL VAIVTPMHEN GELDLERFQS LIDFHVTEGT DGIVVVGTTG ESPTVDFEEH 
HLLIKTAVEQ AAGRVPVIAG TGANSTREAI DLSIYAKNAG ADASLSVVPY YNKPTQEGLY
QHFRAVAEAV DIPQILYNVP GRTVADIAND TVLRLAQIPN IVGIKDATGD IGRGFDLLCR
APEDFAIYSG DDASALALLL LGGHGVISVT ANVAPKLMHE MCIAAFAGDL AAARAANRKL
LRLHLDLFIE ANPIPVKWAV AQMGLIGEGL RLPLTPLSNR YHQTLREAMS EAGIDLAISV