Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1734 |
Symbol | |
ID | 3786211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1982237 |
End bp | 1983508 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637811820 |
Product | hypothetical protein |
Protein accession | YP_412423 |
Protein GI | 82702857 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.996775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC CCTGGCTGGA AGGTATAGAG GTTGTCGATG TAAACACCCC CGTATCTGCG TGGGGCGGCT GGTTTGAAAC ATTCGTTATA ACTGCTGTTG TAATTGCGGC ATCATTCATT ACCCAGCAGG CGGATCCTTT CCGATTGAGT GGCGGATTCC CCTGGGCAGT GCTGGCACCC CTGCTCGCGG GATTACGGTA CGGGTTTGTC TTTGGTTTTG TCAGCGCGCT GCTGACACTC GCAGTCCTTG GCGTCGCCAT CGACCAGCAA TGGCAGGCCG CGAAGAGCTT TCCGCTGCCT TGGGCGATAG GCGTGGTGGT GGTGGCGATG GTGGCAGGGG AGTTTCGTGA CATGTGGGGG CGTCGCCTGC ACCGGCTGGA GGGCGCGTAT CAATACCGCG CCGAGCGTCT TGAGGAATTC ACGCGCAGTT ACCAGTTATT GCGCCTCTCG CATGACCGTC TCGAACAGAC CGTTGCCAAC AGTGGCTTTT CCCTGCGTGA AGGCATCATG CACCTTCAAT CCACGCTGGA CGCCATCGAT GGAATGACGG AAAGCTCGCT GCAAAAGCTT ATTGAATTTG TGGCGGAATA TGGTGCATTG ACTCAGGCCT GCATCATCGG GATTACTGCC GACCGCATTG ATACTTCCAA CGTTCTGGCC TGTGTGGGAG AGCGCTTTCC CATCGATGTG ACTGATCCAG TGCTGAGAAT GGCGCTCGAC AGCGGCGAAC TGGCCACGTT GAATCTTCTG CAGGAATCAG AGATGGACCA GGCCCAGCTT CTGGCTGTGG TACCTTTAAC CGACTCCGTC GGCGAAATAA ATGCCGTGCT GGCAGTGCGT TCCATGCCTT TTTTTTCGTT TCATGAAAGC AATCTCAAAC TTATCGCGGT GCTGGTGGCC CACGGCGTGG ACCATCTCCG CTTTGGAACT GCGAGGCCAT CGGTTCGTCG GTTTATTGCT TCATTTGAAC GGGCATATCA GGATTTTTCG CGCTTCAAGC TCGATACCGT GCTCCTGAGA TTATCCGGAA ATCCGGAGGA GGTGCGGAGC GTTCACGAAA AGCTGCGGTT TTCGATTCGT GCCATCGACT TTATCTGCCT TGCGCGTGAA AAGGATCAGT ATGTCGTCTG GGCGATGTTG CCACTGACGG ATATTACTGG GGCACGGGCA TGGGCGCAGC GAGTAGCCGA TATTCCCGCA ACAACCGCGC AGGAATGGAT GTCTATCAAT GAAATTGATC CGCAAAGGAT CCGTTCCCTG GAGCAGGGGT GA
|
Protein sequence | MKKPWLEGIE VVDVNTPVSA WGGWFETFVI TAVVIAASFI TQQADPFRLS GGFPWAVLAP LLAGLRYGFV FGFVSALLTL AVLGVAIDQQ WQAAKSFPLP WAIGVVVVAM VAGEFRDMWG RRLHRLEGAY QYRAERLEEF TRSYQLLRLS HDRLEQTVAN SGFSLREGIM HLQSTLDAID GMTESSLQKL IEFVAEYGAL TQACIIGITA DRIDTSNVLA CVGERFPIDV TDPVLRMALD SGELATLNLL QESEMDQAQL LAVVPLTDSV GEINAVLAVR SMPFFSFHES NLKLIAVLVA HGVDHLRFGT ARPSVRRFIA SFERAYQDFS RFKLDTVLLR LSGNPEEVRS VHEKLRFSIR AIDFICLARE KDQYVVWAML PLTDITGARA WAQRVADIPA TTAQEWMSIN EIDPQRIRSL EQG
|
| |