Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0541 |
Symbol | |
ID | 3784724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 623868 |
End bp | 625124 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637810623 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_411241 |
Protein GI | 82701675 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGGTT TTCCTTCCTT CAGTTACCGC AATGGGCATT TGTGGGCAGA GTCGGTGCAA CTGGAGCGCA TCGCGCTTGA ATTCGGCACG CCCTGCTATG CCTATTCGCG CGCGGCGCTG ACTACAGCCT ACTGTGAATT CGACGATGCC TTCGCAGGAC GCGATCATCT GGTCTGTTAC GCCGTCAAGG CGAATTCCAA CCTTGGCATT CTCAATCTGC TCGCCCGCCT GGGTAGCGGC TTCGACATCG TCTCCGGGGG AGAACTTCAG CGGGTATTGA AGGCGGGGGG CGATCCACAG AAAATCGTTT TTTCGGGTGT GGGCAAGCAG CCTGCGGAAA TGAAAGCGGC GCTCGAAGCG GGCATACTGT GCTTCAACGT GGAATCGGAA GCGGAGTTGC ATGGGCTGAA CCGCGTGGCC GGAGAAATGG GCAAGGTTGC GCCGGTGAGC CTGCGGGTGA ACCCGGATGT GGATGCGAAA ACGCACCCAT ATATTTCCAC CGGACTCAAG GAAAACAAGT TCGGCATTCC TTTCGATGAG GCGGAGGCGC TTTATGCTTC TGCGCAGGCG CTTGGCAATG TACGGGTGGC CGGTCTCGAT TGCCATATCG GCTCGCAATT GACCGAACTG GCGCCTTTCA TCGAAACCTG CAAGAAGATG CTGGGGCTGC TCGACCGGCT GGAGGCGCAA GGGTTGGAGA TCGAGCACCT CGATCTGGGC GGCGGCCTGG GGATACGTTA TGCCGGGGAA AATCCCCCCT CGGCACGGGA ATATGTCGAG GCGTTACGCA GCGTGGTGGG CAACCGCAGG CAGAAAATCC TGATCGAACC GGGAAGATCG CTGGTGGGCA ATGCGGGTGT ACTGCTTACA ACCGTCGAGT ATCTCAAGCC TACGCCGCAT CGCGATTTCG CCATTGTCGA TGCAGCCATG AACGACCTGA TGCGCCCGGC CTTATATAAG GCTTACCATG AAATCCTTCC GGTGGTTGCG CGCAACGAAA TGGATGCAAA AACCTATCAG GTTGTCGGTC CGGTATGCGA AACCGGCGAT TTTCTTGGGC ATGACCGCCA CCTGGCGCTG GCGCAGGGTG ATCTGCTCGC TGTCATGTCG GCCGGCGCTT ATGGCATGAG CATGAGCTCT AATTACAATG CGCGTCCACG CGCCGCCGAG GTGATGATCG ATGGCGACCG CATTCATCTC ATTCGCGAGC GGGAATCGGT CGAGCAATTG ATGGCAGGCG AGAAAATCCT CCCATAG
|
Protein sequence | MSGFPSFSYR NGHLWAESVQ LERIALEFGT PCYAYSRAAL TTAYCEFDDA FAGRDHLVCY AVKANSNLGI LNLLARLGSG FDIVSGGELQ RVLKAGGDPQ KIVFSGVGKQ PAEMKAALEA GILCFNVESE AELHGLNRVA GEMGKVAPVS LRVNPDVDAK THPYISTGLK ENKFGIPFDE AEALYASAQA LGNVRVAGLD CHIGSQLTEL APFIETCKKM LGLLDRLEAQ GLEIEHLDLG GGLGIRYAGE NPPSAREYVE ALRSVVGNRR QKILIEPGRS LVGNAGVLLT TVEYLKPTPH RDFAIVDAAM NDLMRPALYK AYHEILPVVA RNEMDAKTYQ VVGPVCETGD FLGHDRHLAL AQGDLLAVMS AGAYGMSMSS NYNARPRAAE VMIDGDRIHL IRERESVEQL MAGEKILP
|
| |