Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0241 |
Symbol | |
ID | 3785727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 258346 |
End bp | 259338 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637810316 |
Product | hypothetical protein |
Protein accession | YP_410941 |
Protein GI | 82701375 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02925] peptidyl-prolyl cis-trans isomerase, EpsD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAAC AAATTGATCA TATGAAAATG AATCGTCAGA AATTCATAAC GCCACTATTA CTTCCGTTGC TTGTTTTAGG ACTCGCAGCC TGCGACAAAA AAGAATCCTC ACAACCGGCC AGCCAGGTAG CGGCCAAAGT AAATTCCGGC GAAATATCCG TGCATCAACT CAATTATGTT CTGGCCAGAG CCACGCGCAA TAGCGCAAAT TCTCCGGAAA ATGCATCCAG GATTCGCCGG GAAGTTCTTG ATCGCCTGGT CGATCAGGAA CTTGCGGTAG AACAGGCTAT CAATAAAAAA CTTGACCGTT CGCCGGAAGT TCTCATGGCC CTGGATAATG CACGTAGAGA GATTCTTGCA CGTGCCTATC TCGAGCAGAT CACTGCGGCT ACTCCAAAGC CAACCGTCGA AGAAGCAAAA ACCTACTATT CGGAACATCC ACAGCTTTTC GCGGAACGGC GCATTTATAA CATACAGGAA ATTGTGCTTC CTTCATCAGC CGGCGTCGCG GACGAGCTGC GGGAAATGCT CGACTCAGGC AAACCCATGG AAGACATTGC GAAGTGGCTC AAAGGCAAGG ACATAAAGTT TGCCGCTGGC AGCGCAACAC GTTCCGCCGA GCAGATTCCC CTTGAGATCC TGCCCAAGAT ACATCCGCTG AAACCCGGCC AGGGCTTGCT CATCCAGGGT CCGCAATCGA TTACCCTCAT GCGGATCGCT GCCGCACAAA CGGCGCCGAT TACCGAAGCA GCAGCGTTGC CGCGCATCCA GCAGTTTCTT GGAAATCAGC GCGCGGCGGA AGCCGCCCGG GGGGAAATAA AGGCTCTCAA GGCAAAAGCC AATATCACCT ACATGGGCGA ATTCGCTGTT CCGGATAAAA ATGCATCCGG GGGAGAAAAT TCCGTTTCGT CAGTCGCCAG TGCTAGGGAT TCAAGTTCGC CCCAAGCCGA TAATCCCGGC CTTGAAAAGG GTGCGGCGGG TTTGTTGAGA TAA
|
Protein sequence | MAQQIDHMKM NRQKFITPLL LPLLVLGLAA CDKKESSQPA SQVAAKVNSG EISVHQLNYV LARATRNSAN SPENASRIRR EVLDRLVDQE LAVEQAINKK LDRSPEVLMA LDNARREILA RAYLEQITAA TPKPTVEEAK TYYSEHPQLF AERRIYNIQE IVLPSSAGVA DELREMLDSG KPMEDIAKWL KGKDIKFAAG SATRSAEQIP LEILPKIHPL KPGQGLLIQG PQSITLMRIA AAQTAPITEA AALPRIQQFL GNQRAAEAAR GEIKALKAKA NITYMGEFAV PDKNASGGEN SVSSVASARD SSSPQADNPG LEKGAAGLLR
|
| |