Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1077 |
Symbol | |
ID | 3784691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1242916 |
End bp | 1243863 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811161 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_411772 |
Protein GI | 82702206 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.578473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTCT TTTCCGCATG GGTGGGGTAT AGAGTCAGCA AACCTGTTGA GCTTCCCATC GTTCCTTATG AATTTTCGAT CGAGCAAGGC AGCAGTGTAA AAACCATAGC CAGTCAACTG GCGTATGCCG GTGTTTTGCC TGACGCTTGG TCTTTTGTCC TGTTGTCGCG CTTGATGGGT GTCGCTACAT CGCTCAAGGC GGGGGACTAT GAGCTCACTG CGAGCATTTC GCCATTTCAA TTGCTACAGC GGATAACCAG GGGTGATAGC AGCCAGAGTG AAATCAGATT CATTGAGGGT TGGACGTTTT CCCAACTCAG GCGAATATTG GATGAGCACC CGGCTCTCCG TCACCAGACC ACTCACCTGA GCAATGCGGA GATTCTGCGG CTGATCGGAG CAACCGAGAC TGCGGCAGAG GGGCTGTTTT TTCCGGACAC CTATTTCTTT GCCCGCGGGA GCAGCGATGT GGCAGTATTG AAACGCGCCT ATCGCGCGAT GCGCAACCAC ATGGATAGCG CCTGGGCACA GCGGGCAGCA AACCTTCCTC TGAAAGATCC GTACGAGGCA CTGATCCTGG CATCGATCGT TGAAAAGGAA ACAGGCCGGG AGGATGATCG CGGAATGGTT GCAGCCGTGT TTATCAACCG CTTGCGGTCA CGCATGTTGC TACAGACGGA TCCGACAGTT ATTTATGGTC TGGGCGATAA ATTCGATGGC AATCTGCGTA AAAAAGATCT CTTGAGCGAC CAGGAATATA ACACTTACAT ACGTCCTGGC TTGCCACCAA CTCCCATTGC CTTGCCTGGT TTGGCTTCAA TCCGCGCCGT GCTCAACCCC GCAACGACTG ACGCGCTGTA TTTTGTTGCG AAAGGGAACG GAGAGTCGCA TTTTTCCAGC AATCTGTCTG ACCATAACCG GGCCGTTTCC AAATATCAGA AACGGTAG
|
Protein sequence | MAFFSAWVGY RVSKPVELPI VPYEFSIEQG SSVKTIASQL AYAGVLPDAW SFVLLSRLMG VATSLKAGDY ELTASISPFQ LLQRITRGDS SQSEIRFIEG WTFSQLRRIL DEHPALRHQT THLSNAEILR LIGATETAAE GLFFPDTYFF ARGSSDVAVL KRAYRAMRNH MDSAWAQRAA NLPLKDPYEA LILASIVEKE TGREDDRGMV AAVFINRLRS RMLLQTDPTV IYGLGDKFDG NLRKKDLLSD QEYNTYIRPG LPPTPIALPG LASIRAVLNP ATTDALYFVA KGNGESHFSS NLSDHNRAVS KYQKR
|
| |