Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1454 |
Symbol | |
ID | 3785545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1661588 |
End bp | 1662859 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637811542 |
Product | hypothetical protein |
Protein accession | YP_412149 |
Protein GI | 82702583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.15372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAA CAAACGTAGC AAGCGGAAGT TCACTGGCAG TCAAACATTA TAGCGCGGCG CTCTTCGCCA ACACGCTCAA AGGATCCACA GCGATTGACA GTCTTGTGGG CCCGGTCGAG CCTTCAGTAG CAATGCAGAA AATTGCCGGT CAAACAAATC CGGGCATGCC TATCGTGCGA ATCGATAATT TAATGAAAAG TGCTGGCGAT GTCGTATCGC TCGATCTGGT CGATACGGTG GGCGGTGAAC CATTGATGGG CGACGTCAAT CGTGAAGGAC GGGGCAGCGC GCTTTCGTTT TCCTCGATGG AGATCAAGAT CGATCTATCC AGTAAGGTTA TCGATGCCGG TGGCAGCATG TCGCAGCAAC GCACCAAGCA TCAGTTGCGG GAAATTGCCC TGGCGCAATT GTCAGGTTAT TTCCCCCGTC TCGACGCTCA GGAAACCCTG GTGCATCTTG CAGGAGCACG CGGTTCGCAA ACCGGTTCGG ACTGGACAGT ACCGCTTCAA AGCGCGCCGA ACTTCAGTTC CATCATGGTG AATCCCGTGA AGGCCCCTAC CTATAATCGT CATTTTGTAG TGAACGGCGC CAATCTGACT TCAGGGGGAC AGCAGTTGGG ATCCGTCGTT TCAACGGACG CCCTGCGCTT GTCGCATCTG GATCTGCTGC GGAAGAGGCT TGATGACATG GATCAGCCAT TGCAATCCGT CAAACTGGCA GGGGACCGAG CCGCTCAGAC TTCCAAGATG TGGGTATTTC TCGCCACACC CAATCAGTAT TCGCTCCTTT TGACCGAAGG TTCGTTACGT GCTTTCCAGC AGAATGCCAT CAATCGGGCG GCATATTTTG ACGAGCGCCA CCCCCTGTTT GCCGGTGAGG TTGGGATGTG GAATGGCATT CTGGTGATCA AGAATGAGCG TGCGATCCGT TTTATGCCTG GGGAAAGCAC AAAGATAGTC ACCGCAGCAA ACGCGACAAC TGCCACAGAA ACCGATCAGG CTGTCAATGG AGCGTTGACT GCCGGGTACG CGATCGAGCG CGGATTATTA TTAGGCGCAC AAGCACTGGG GGTCGCTTAT GGCAAAACCA GGGTCAGCGG AATGCAGTTC GGATGGAAGG AGCATTGGTA TAACTTTGAA AGTAACCTGG AAGTGATGGG TGAGAAGGTT TGCGGCAAAG CGAAAACCCG TTTCTCTATC GACGATGGAA CAGGTTTCAG GGTACCCACC GACTTTGGTG TGATTGCGGT TGACTCGGCT GTGCCGCTTT AA
|
Protein sequence | MAETNVASGS SLAVKHYSAA LFANTLKGST AIDSLVGPVE PSVAMQKIAG QTNPGMPIVR IDNLMKSAGD VVSLDLVDTV GGEPLMGDVN REGRGSALSF SSMEIKIDLS SKVIDAGGSM SQQRTKHQLR EIALAQLSGY FPRLDAQETL VHLAGARGSQ TGSDWTVPLQ SAPNFSSIMV NPVKAPTYNR HFVVNGANLT SGGQQLGSVV STDALRLSHL DLLRKRLDDM DQPLQSVKLA GDRAAQTSKM WVFLATPNQY SLLLTEGSLR AFQQNAINRA AYFDERHPLF AGEVGMWNGI LVIKNERAIR FMPGESTKIV TAANATTATE TDQAVNGALT AGYAIERGLL LGAQALGVAY GKTRVSGMQF GWKEHWYNFE SNLEVMGEKV CGKAKTRFSI DDGTGFRVPT DFGVIAVDSA VPL
|
| |