Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0004 |
Symbol | glyA |
ID | 3786442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 5730 |
End bp | 6980 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810072 |
Product | serine hydroxymethyltransferase |
Protein accession | YP_410705 |
Protein GI | 82701139 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0112] Glycine/serine hydroxymethyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTCCT CATACAATAC TCTCGAAACC GTCGACCCCG ATCTCTGGCA AGCCATCAAA GGCGAGATGC AGCGCCAGGA AGAATATATC GAGCTGATTG CTTCGGAGAA TTATGCAAGT CCTGCAGTGA TGCAAGCGCA GGGCTCGGTG CTGACTAACA AGTATGCCGA GGGTTATCCC GGCAAGCGCT ACTACGGCGG CTGCGAGTAT GTGGATGTCG TGGAACAGCT GGCAATCGAC CGGGTAAGGG CGCTGTTCGA CGCGGAGTAT GTCAACGTCC AGCCGCATTC GGGCTCACAG GCCAACGCGG CAGTCTATCT GACCGCCTTG AAGCCGGGAG ATACTCTGCT GGGGATGTCG CTCGCGCATG GCGGGCATCT GACACATGGC GCTTCGGTCA ATCTCAGCGG CAAGATTTTC AATGCCGTTT CCTATGGCCT TCGTTCTGAT ACCGAAGAGC TGGACTATGA CGAGGTTGCG CGCCTGGCGC ACGAGCATAA GCCCAAGCTG ATCGTGGCAG GGGCTTCGGC TTACTCGCTG GTGATAGACT GGAAGCGCTT CCGTAAGATT GCCGACGATA TAGGGGCCTA TCTGTTCGTG GATATGGCGC ATTATGCGGG ACTGGTTGCG GCAGGATACT ATCCCAATCC CGTCGGCATC GCCGATTTCG TCACGAGCAC CACCCACAAG ACGCTGCGCG GTCCGCGGGG CGGGATCATC ATGGCCAGGG CCGAGCATGA AAAAGCGCTC AATTCCGCTA TTTTCCCGCA AACCCAGGGG GGGCCGTTGA TGCATGTCAT TGCCGCCAAG GCAGTAGCCT TCAAGGAAGC CGCCAGCCAG GAGTTCAAGG ACTACCAGGA ACAGGTAATC GACAATGCGC GCGTAATGGC GAAAGTGTTG CAGGAGCGCG GATTGCGCAT CGTCTCGGGG CGCACCGACT GCCACATGTT TCTGGTGGAC CTCCGCCCCA AGTATATTAC CGGCAAGCAG GCCGCCGAAT CGCTGGAAGT GGCGCATATC ACCGTCAACA AAAATGCGAT TCCCAACGAC CCGCAGAAAC CCTTCGTCAC CAGTGGGATT CGAATCGGCT CTCCCGCCAT CACCACCCGC GGCTTTGCCG AATTCGAATC CGAACAACTG GCTCATCTGA TAGCTGATGT ACTGGAAGCG CCGACCGATT CCTCGGTGCT CACGGAGGTT GCACGCCAGG CAAAAGCGCT ATGTGCAAAA TTTCCGGTTT ACCAAGGGTA A
|
Protein sequence | MLSSYNTLET VDPDLWQAIK GEMQRQEEYI ELIASENYAS PAVMQAQGSV LTNKYAEGYP GKRYYGGCEY VDVVEQLAID RVRALFDAEY VNVQPHSGSQ ANAAVYLTAL KPGDTLLGMS LAHGGHLTHG ASVNLSGKIF NAVSYGLRSD TEELDYDEVA RLAHEHKPKL IVAGASAYSL VIDWKRFRKI ADDIGAYLFV DMAHYAGLVA AGYYPNPVGI ADFVTSTTHK TLRGPRGGII MARAEHEKAL NSAIFPQTQG GPLMHVIAAK AVAFKEAASQ EFKDYQEQVI DNARVMAKVL QERGLRIVSG RTDCHMFLVD LRPKYITGKQ AAESLEVAHI TVNKNAIPND PQKPFVTSGI RIGSPAITTR GFAEFESEQL AHLIADVLEA PTDSSVLTEV ARQAKALCAK FPVYQG
|
| |