Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1914 |
Symbol | |
ID | 3784152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2204065 |
End bp | 2205120 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812000 |
Product | tRNA pseudouridine synthase A |
Protein accession | YP_412601 |
Protein GI | 82703035 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0101] Pseudouridylate synthase |
TIGRFAM ID | [TIGR00071] pseudouridylate synthase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTGTG CGAAAGCGGT GGGCAATGAC GGACTACCCA ATCAGCGCCC AATGTCACGC TGGAGAGCGT CTGCTTATAC GTCATTACTT CCGTTTCCCG AAAAGTGTTT AAAAATTCCC TGGACGTGCC GGTACAAGTA TCACCAGAAT CACGATGGTA GCTTTACCGT GAGAATCGCA ATGGTGCTGG AGTATGACGG CAGCAATTTC TGCGGCTGGC AAAGCCAGCC AGGGGGAAAT ACCGTGCAGG ATGCCGTGGA AGCGGCCTTG TCTGAAATTG CAGGCGAGGC TATCCGAGTA GTGACGGCAG GGAGAACCGA CGCAGGGGTT CATGCGATCT ACCAGGTGCT GCATTTCGAT ACTCGGGCGG AGCGACCTAT GAATGCATGG GTGCGGGGTG CAAACGCCCT GCTGCCCAGC GGCATTGCCC TGCTATGGGC ATCCCCTACT GCAGACGATT TTCATGCCCG CTACTGTGCG CTTGAGCGTT GTTACCTCTA CCTGTTACTG AACCACCCAG TGCGGCCGGG CCTTCATCAG CACCGAGTCG GCTGGTATCA CCATCCGCTC CGTCTCGAAT CCATGCAGAT GGGGGCACAA ATGCTGGTGG GCGAACACGA TTTCAGCGCC TTTCGGGCTG CTGCATGCCA GGCCAAATCC CCTGTACGCA CCCTGACAAA ACTGGAAGTT ACGCGAGTGG GAAACATGGT TGCGTTTGAG CTGCGCGCTA ATGCATTTTT GCACCACATG GTACGGAATA TCGTCGGTTG TCTGGTCTAT GTAGGTAAAG GTAAATTTCG TCCTGACTGG ATAGGGAAAC TGCTTGAAAA CGGGAAGCGC AGCGAAGCTG CACCGACTTT TTCCGCTTCC GGGTTATACT TGGCAGGGGT TGCCTATGAT GCGAGGTGGA AGCTGCCACC CTTTGTCGAG CCCCCTCTGA CCGCAATAGT GCCGGGCACA AACAGGCCGG CTATCCTCAC ATCATGGGCG ACAAGTGGCG GAAACCCAGT TGCGGGCGCA ACACCGGAAG TCAGGGATAT ATGTCGATTC GAGTAA
|
Protein sequence | MACAKAVGND GLPNQRPMSR WRASAYTSLL PFPEKCLKIP WTCRYKYHQN HDGSFTVRIA MVLEYDGSNF CGWQSQPGGN TVQDAVEAAL SEIAGEAIRV VTAGRTDAGV HAIYQVLHFD TRAERPMNAW VRGANALLPS GIALLWASPT ADDFHARYCA LERCYLYLLL NHPVRPGLHQ HRVGWYHHPL RLESMQMGAQ MLVGEHDFSA FRAAACQAKS PVRTLTKLEV TRVGNMVAFE LRANAFLHHM VRNIVGCLVY VGKGKFRPDW IGKLLENGKR SEAAPTFSAS GLYLAGVAYD ARWKLPPFVE PPLTAIVPGT NRPAILTSWA TSGGNPVAGA TPEVRDICRF E
|
| |