Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1912 |
Symbol | |
ID | 3784150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2202271 |
End bp | 2203467 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637811998 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_412599 |
Protein GI | 82703033 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAGTCT ATGATCTTCC GGACAGTCAT GGGCACTTTG GTCCGTATGG CGGCATCTTT GTCGCTGAAA CGCTGATCTC GGCACTGGAG GATTTACGCA TACAGTACGA GCGCTATCGC AGCGACGCCG ACTTTCAGGC AGAATTCGCC CATGAACTCA AGCATTATGT GGGACGCCCG ACTCCCATCT ATCATGCAAA ACGATGGTCT GCGCAGCTGG GGGGCGCCCA GATCCTGCTC AAGCGCGAGG ACCTGAACCA TACGGGCGCC CACAAAATCA ATAATGCGAT GGGACAGGCG CTGCTGGCAA GGCGCATGGG CAAATCGCGG GTAATAGCCG AGACCGGCGC AGGCCAGCAT GGTGTTGCCA CGGCCACGGT CGCAGCGCGG TATGGGATGG AGTGTGTCAT CTACATGGGA TCGGTCGATG TGGAGCGCCA GGCTGCCAAT GTTTACCGGA TGAAGCTTCT GGGTGCGGAG GTCATACCGG TCGAATCCGG CTCACGCACG CTGAAGGATG CTTTGAACGA GGCCATGCGC GACTGGGTGA CCAATGTTGC GGATACCTTT TACATCATCG GTACTGTGGC CGGACCTCAT CCCTATCCCA TGATGGTACG GGATTTCCAG GCGATCATCG GGCGCGAGGC CATAACCCAG ATGCAGGAGG ATTACGGACG GCAGCCCGAT GCCCTGATCG CCTGCGTGGG GGGGGGGTCA AATGCCATCG GGCTTTTTTA TCCCTACCTC GACAGCAGCA TCCGGATGAT CGGAGTGGAA GCGGCCGGAC ACGGCGTGGA GACGGATCAG CATGCCGCTA CCCTGACGAA GGGACGCCCT GGCGTGTTGC ACGGCAATCG CACTTACCTC ATCCAGGATG AGAATGGGCA GATCGTCGAA ACACATTCCA TTTCGGCAGG CCTGGATTAT CCCGGCGTAG GGCCGGAGCA TGCCTGGCTC AAGGACAGCG GACGGGCCGA ATATTTCGGA ATAACCGACG AGCAGGCTCT GGAAGCCTTC CACGCACTGT GCCATTACGA AGGGATCATT CCCGCGCTCG AATCCAGTCA TGCGCTGGCT TATGCAGCCC GGCTCGCGCC TGCCCTCACT TCCGACAAGT TGCTGCTGGT GAACCTCTCG GGGCGTGGGG ACAAGGATAT GCCCACCGTG GCGCGCGCTT CGCATATCAC CTTCTAA
|
Protein sequence | MKVYDLPDSH GHFGPYGGIF VAETLISALE DLRIQYERYR SDADFQAEFA HELKHYVGRP TPIYHAKRWS AQLGGAQILL KREDLNHTGA HKINNAMGQA LLARRMGKSR VIAETGAGQH GVATATVAAR YGMECVIYMG SVDVERQAAN VYRMKLLGAE VIPVESGSRT LKDALNEAMR DWVTNVADTF YIIGTVAGPH PYPMMVRDFQ AIIGREAITQ MQEDYGRQPD ALIACVGGGS NAIGLFYPYL DSSIRMIGVE AAGHGVETDQ HAATLTKGRP GVLHGNRTYL IQDENGQIVE THSISAGLDY PGVGPEHAWL KDSGRAEYFG ITDEQALEAF HALCHYEGII PALESSHALA YAARLAPALT SDKLLLVNLS GRGDKDMPTV ARASHITF
|
| |