Gene Nmul_A1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1964 
Symbol 
ID3784987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2258107 
End bp2259282 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content55% 
IMG OID637812052 
ProductOrn/DAP/Arg decarboxylase 2 
Protein accessionYP_412651 
Protein GI82703085 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATGC GTAATACCGC TCTACGGCAA CCCGTAGAAG TAGAAGAAAT ATTCGACACT 
CACCCCGAAG TCAGCCTGGA TTTTGAACAC GTTCAGGCAG CCCTCAAACA GGGCTACAGC
AAGCCATTCC TGCTGGTTGA CAGTAATATC ATCCGCAACA AGGCCCGAAG ATTCAAAGCG
GCCATGCCGC GCGTGCAGCC GCACTACGCG GTCAAGGCCA ATCCTGATCC GCGCGTATTG
AAAACGCTGA TCGAGGAAGG CGTTGGGTTT GAAATTGCCT CCATTTCAGA GCTTGACCTC
CTGCTCAGCC TGAATGTGCC TGCCGCGGAT ATTTATTACA GTAATCCGAT GAAATCCCGG
GCGTACCTGG AATATGCCGC AGCCAAAGGC GTGGAGTGGT ATGTACTGGA TAGTGTCGAG
GAGCTACGCA AGATCGTCAG CGTGAAGCCG GATGCGAAAC TGTATTTGCG GATCGATACG
CCCAACATCG GGAGCGACTG GCCGCTTGCC GGCAAGTTCG GCACGCATGT GGCCGAGATC
AAGGAGATTA TTGACGAAGC CGCGAACCTT CAGGCTGATC TCGCCGGCGT GACCTTCCAC
GTCGGATCGC AATGCCGTAA TCCGCAGAAC TGGCGGGTGG GCATCGAGCG GGCAATCAAG
GTATTCGCCG ACATGCGCCA GGCAGGATTG TCACCGCGTT TGCTCAATAT CGGCGGCGGC
TATCCGGTGC GGCATGTCAA GCCCATTCCA TCGATAGAAG TTATCGGTGA GGTCGTGAAC
GAAGCGATTG CGAACCTGCC GGAGAACATT CGCATCATGG CTGAGCCCGG GCGCTACCTC
GTCTCGGATG CGGCCTATTT TGTCTGCCGC GTAGTAGGAA CGGCCACCCG CAACGGCAAA
CGCTGGATGT ATTGGGATGC AGGCGTCTTT GGGGGCGTGA TCGAGGTCAC GGAGGGTTTA
CGGTATGAAA TTCTTTCTGA TCGCACGGGA CCGAGCATTC CCTGGTCTGT AGCAGGGCCA
ACCTGTGATT CGGTCGATAT TTTGATGCGC GATGAACTCC TGCCGGAGGA TATCGAGGAA
GGCGATTTCA TCTATATACC CAATGCCGGC GCTTATACGA CAGCCTACGC CAGTAACTTC
AACGGCTTTC CTTTGCCAGA GGTCGTCGTC CTGTAA
 
Protein sequence
MQMRNTALRQ PVEVEEIFDT HPEVSLDFEH VQAALKQGYS KPFLLVDSNI IRNKARRFKA 
AMPRVQPHYA VKANPDPRVL KTLIEEGVGF EIASISELDL LLSLNVPAAD IYYSNPMKSR
AYLEYAAAKG VEWYVLDSVE ELRKIVSVKP DAKLYLRIDT PNIGSDWPLA GKFGTHVAEI
KEIIDEAANL QADLAGVTFH VGSQCRNPQN WRVGIERAIK VFADMRQAGL SPRLLNIGGG
YPVRHVKPIP SIEVIGEVVN EAIANLPENI RIMAEPGRYL VSDAAYFVCR VVGTATRNGK
RWMYWDAGVF GGVIEVTEGL RYEILSDRTG PSIPWSVAGP TCDSVDILMR DELLPEDIEE
GDFIYIPNAG AYTTAYASNF NGFPLPEVVV L