Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1510 |
Symbol | |
ID | 3786096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1726369 |
End bp | 1728066 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637811598 |
Product | hypothetical protein |
Protein accession | YP_412205 |
Protein GI | 82702639 |
COG category | [S] Function unknown |
COG ID | [COG0397] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAA GCAATCTTCA GAGGAGTATG CCGATCGTCA CGCTGCCGGA TCTGTTCGAT GCTCGGTTCG ACAACCGCTT TGTGCGCCAG CTGCCAGGCG ATCCGGAAAC CCGGAATGTT CCCCGTCAGG TGCGCAATGC CGGTTATACA CAAGTGAGTC CTACGCCTGT CCGCTCACCG CGACTTCTCG CCTGGGCGGA TGAAGTCGGC GAAATGCTCG GTATTGCCCG GCCGGCATCT CCCGTTTCCC CAGCGGTGGA AGTGCTTGCC GGTAACAGAA TCCTTCCGTC CATGCAGCCT TATGCAGCAC GCTATGGCGG ACACCAGTTC GGGCACTGGG CAGGGCAGCT TGGCGACGGG CGCGCTATCA CCCTGGGGGA GTTGATCAGC CCCAACGATA AGCGCTACGA GCTACAACTC AAGGGTGCAG GGAAAACGCC CTATTCACGC ACCGCGGATG GACGTGCGGT CCTGCGTTCT TCCGTACGCG AGTTTCTGTG CAGTGAGGCG ATGCACTCCC TCGGGGTGCC TACTACGCGG GCATTGAGCC TGGTAGCGAC AGGGGAAGCG GTGATACGCG ATATGTTTTA CGACGGACAT CCGGGGGCGG AACCCGGCGC GATCGTCTGC CGCGTCTCGC CCTCGTTCCT GCGCTTTGGC AATTTCGAGA TACTTGCGGC CCAGAAGGAG CCAGAACTTC TCAGGCAGCT CGCCGACTTC GTGATAGGGG AACATTTTCC GGAACTGGCC TCGTCCCATC GGCCACCTGA AGTTTATGCG AAATGGTTCG AGGAGGTTTG CCGCCGCACA GGTATCCTCG TCGCCCACTG GATGCGGGTC GGTTTCGTCC ACGGCGTGAT GAATACCGAC AATATGTCCA TATTGGGGCT GACCATAGAC TATGGTCCTT ATGGGTGGCT CGAAGGTTTC GATCTGCACT GGACGCCTAA TACGACTGAC GCACAGGGGC GGCGTTATTG CTACGGTAAC CAGCCCAAGA TCGCGCAGTG GAATCTGACT CGCCTGGCTG GCGCGTTGAC ACCCCTGATA GAAGATGATG CTGCGCTGGA GCATGGGTTG GCAGTCTTCG GTGAAACATT CAATAACACA TGGAGTGGCA TGCTGGCCGC CAAGCTCGGG TTGGCCTCAC TCGAACACTC CGACGATGAC TCGCTTTTGA GCGATCTATT CGAAACGCTG CAACAGGTTG AGACGGATAT GACATTGTTC TTTCGCTGCC TGATGAACAT TCCTCTGAAT CCGATCTCCG GAAACAGGGC AACAACCTTC CCTGCTCCAG AGAACCTGGA AAGTGTGGAT CAAATGAATG ATCATGGACT GGTCGAGCTT TTCCGCCCGG CATTTTACGA CGCGCATCAG GCATTTTCCC ATGCGCACCT CACACGACTG GCCGGCTGGC TGCGACGCTA TATCGCAAGG GTGCGCCAGG AAGGGGAACC TGAAGGCCTG CGTTACCATC GCATGAGCCG TGCAAATCCG AAATACGTAC TACGCAACTA TCTGGCTCAG CAGGCAATAG AAGCGCTGGA GCGGGGGGAT GATTCCGTGA TAATACGGTT GATGGAAATG CTGAAGCACC CTTATGACGA ACAGCCCGAG CACGAGGATC TTGCGGCAAG ACGTCCAGAG TGGGCCCGTA ATAAGCCCGG CTGCTCCGCT TTGTCGTGCA GCTCCTGA
|
Protein sequence | MSQSNLQRSM PIVTLPDLFD ARFDNRFVRQ LPGDPETRNV PRQVRNAGYT QVSPTPVRSP RLLAWADEVG EMLGIARPAS PVSPAVEVLA GNRILPSMQP YAARYGGHQF GHWAGQLGDG RAITLGELIS PNDKRYELQL KGAGKTPYSR TADGRAVLRS SVREFLCSEA MHSLGVPTTR ALSLVATGEA VIRDMFYDGH PGAEPGAIVC RVSPSFLRFG NFEILAAQKE PELLRQLADF VIGEHFPELA SSHRPPEVYA KWFEEVCRRT GILVAHWMRV GFVHGVMNTD NMSILGLTID YGPYGWLEGF DLHWTPNTTD AQGRRYCYGN QPKIAQWNLT RLAGALTPLI EDDAALEHGL AVFGETFNNT WSGMLAAKLG LASLEHSDDD SLLSDLFETL QQVETDMTLF FRCLMNIPLN PISGNRATTF PAPENLESVD QMNDHGLVEL FRPAFYDAHQ AFSHAHLTRL AGWLRRYIAR VRQEGEPEGL RYHRMSRANP KYVLRNYLAQ QAIEALERGD DSVIIRLMEM LKHPYDEQPE HEDLAARRPE WARNKPGCSA LSCSS
|
| |