Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0484 |
Symbol | |
ID | 3784901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 542214 |
End bp | 543119 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637810560 |
Product | dihydropteroate synthase |
Protein accession | YP_411184 |
Protein GI | 82701618 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.154911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCACCAT CTTCTATTCC GCCTGTAGAA ATTTTCAGGC TCTCTCATCG CCCTCTTGTG ATGGGCGTTG TCAATATAAC TCCTGACTCC TTTTCGGATG GCGGCTTGTT CGCTTCAACC GACCATGCCT TGAACCATGC TTTCCATTTG ATGGAAGAAG GGGCGGATCT GCTGGACGTC GGCGGTGAAT CAACGCGCCC TGGCAGCACC CCGGTTTTTG TCGAAGAGGA ATTACGTCGG ATTCTCCCGG TAGTGGAAGA ACTGGCAAAT CAGAATGTGC GGGTTTCGGT CGACACTTCC AAACCCGAAG TCATGCGCGC TGCCATCGCC GCGGGGGCCG TGATGATTAA CGATGTGAGG GCACTCCAGA TGCCGGGTGC GCTGGAGGCG GTCGCAGAAG GAGGTGTGAT GGCTTGCCTC ATGCATATGC AGGGTGAACC CGCTAATATG CAGGTCAATC CCCAATATGA TGATGTAGTG CAGGATGTCA AGGTTTTTCT GAAGCGGCGG GTGGAGGCTG CCCAAGCCGC AGGTATTCCG CGGGAGCGGC TGGTAGTCGA TCCGGGCTTT GGTTTTGGTA AAAACCAGGT TCATAATATC GAGCTATTAC GGCATCTCGA TCAGTTTATC GATCTTGGAG TACCGGTACT GGTAGGTCTC TCGCGCAAAT CGATGCTGGG AAAAATCACG GGCAGTGACA TAAATAACCG GATTCATGCC AGCATAGCGG CAGCGCTGAT CGCGGCTATG AAGGGTGCGG CCATCCTCCG TGTGCATGAC GTCCGGGCAA CCCGGGATGC GCTCGCTGTT TACAATGCGG TTTACGACCG GGGCTCTGCA CGGGAGAATG CATCATCGCT CACTTCACGG TGGAAACCCT CAGTTGAATT CAATTCATTT CGATAA
|
Protein sequence | MSPSSIPPVE IFRLSHRPLV MGVVNITPDS FSDGGLFAST DHALNHAFHL MEEGADLLDV GGESTRPGST PVFVEEELRR ILPVVEELAN QNVRVSVDTS KPEVMRAAIA AGAVMINDVR ALQMPGALEA VAEGGVMACL MHMQGEPANM QVNPQYDDVV QDVKVFLKRR VEAAQAAGIP RERLVVDPGF GFGKNQVHNI ELLRHLDQFI DLGVPVLVGL SRKSMLGKIT GSDINNRIHA SIAAALIAAM KGAAILRVHD VRATRDALAV YNAVYDRGSA RENASSLTSR WKPSVEFNSF R
|
| |