Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0371 |
Symbol | |
ID | 3784563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 403646 |
End bp | 404857 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637810447 |
Product | UDP-glucoronosyl and UDP-glucosyltransferase family protein |
Protein accession | YP_411071 |
Protein GI | 82701505 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.105227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACCG GAAATAAGCC GATTATTCTT CTAGTGGCTG AAGCAGTCAC CCTTGCGCAT TTCGGTCGAA TCGTGACGTT GGCGAGAGCC CTTGATTCTA ACAAGTATGA AGTGGTTGTC GCCTCGGATC CCCGTTATCT TGATTTAGAC GCGCCACTGA ACTGCACGTT CCATCCTATT CGGTCTATTC CCTCAGCCGA ATTTGCGCTA GCCCTTGCTC GGGGAAAACC AGTGTATAGC CTTGAAACCC TCTCCCGTTA TATCGAAGAT GATCTGGCGC TGCTTGATCT CGTCAGACCT GCTCTCGTCG TGGGAGATTT CAGACTATCT CTGGCCGTGA GCGCGCCGCT ACGGAAAATT CCATTTGCCA CCGTAGTCAA CGCCTACTGG AGCCCCTATG CAGTGACCCG CTACCCTGTT CCCGATCTTC CGGTGACCCG CATTCTCGGT ATCACACTTG CGCAAAAATT ATTCGATATT ATCAGACCCG TTGCCTTCGC CCTCCATGCC AGGCCCTTGA ACCGGTTGCG TCGACGTTTC GGATTGACCC CTTTGAAGCA GGATATCAGA AACACATACA CGTGGGCGGA TTACACTTTA TACGCGGACA TCCCGGAAGT AGTGCCTGCG CTTAATTTAC CTCCCCACCA TCGCTATCTT GGTCCAATAC TCTGGTCCGC CGCCATCTCT CTGCCTGCAT GGTGGGACTG CTTACCCGAA GACAAACCGG TGGTATTTCT AAGCCTGGGG AGCTCGGGCA TGGCAGCACT TCTGCCAATG GCTCTTGCCG CCTTATCACA ACTTGCGATC ACAGTTGTCG TAGCCACTGC AAGAAAAATT ACCATAGATG AGGTTCCAGC GAATGCATAT GTTACTGATT ACCTGCCATT GGACCGGGTT GCTTCACGCT TGAAAATAGT GATCAGCAAC GGCGGCAGCC TGACTACGTA TCAGGCGCTT TCAAGCGGAG TGCCGGTGAT CGGCCTGTGT TCGAACCTGG ATCAACTGCT GAATATGAAT GCAGTGCAGC AGTTGGGGGC CGCTATTACC TTGCATTGCG CTCGGATATC CGTCACTGAC CTTGTGATGG CTGTAACCGC AATGCTGGAT AATCCATCGT ATGGACGAGC AGCAATCAAG ATAAGCCAGA TCTTGGCAGA ATCGGATGCT AAACAGCGTT TTCGGGAATT TGTGGTGCAG GTACTTCATT AA
|
Protein sequence | METGNKPIIL LVAEAVTLAH FGRIVTLARA LDSNKYEVVV ASDPRYLDLD APLNCTFHPI RSIPSAEFAL ALARGKPVYS LETLSRYIED DLALLDLVRP ALVVGDFRLS LAVSAPLRKI PFATVVNAYW SPYAVTRYPV PDLPVTRILG ITLAQKLFDI IRPVAFALHA RPLNRLRRRF GLTPLKQDIR NTYTWADYTL YADIPEVVPA LNLPPHHRYL GPILWSAAIS LPAWWDCLPE DKPVVFLSLG SSGMAALLPM ALAALSQLAI TVVVATARKI TIDEVPANAY VTDYLPLDRV ASRLKIVISN GGSLTTYQAL SSGVPVIGLC SNLDQLLNMN AVQQLGAAIT LHCARISVTD LVMAVTAMLD NPSYGRAAIK ISQILAESDA KQRFREFVVQ VLH
|
| |