Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0438 |
Symbol | |
ID | 3785906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 486326 |
End bp | 487654 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637810514 |
Product | hypothetical protein |
Protein accession | YP_411138 |
Protein GI | 82701572 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTGCA GGCGAAAGCG CAGCAGCCGG ATTCCGTACG GAGTAGGTTG TTTTTTATTG ATCTTCATGA GCAACAGCAT TGTTCAGGCG GTTGACATAG CAGGGTGGTT GCGGGAGAAC GGTGTAAGGG CGGGCGGGTG GATCAATGCG GGGGCAACCT ATAACCCCGG TAATCCCGTC AACGGATACA ATGGCCCCGT TACCTTTGCC GACCGCGCAA ACCGGTTTCA GTTGAATCAG TTCAATCTAT ATCTCCAGCG CACGGTAGTC ACGGAAGGCG AGGCTTTTGA TCTCGGGGGG CGGGTCGATT TCATGTTCGG CACCGATGCG ATCTATACCC AGGCCTTCGG CGTTCCGCCA TTCGATGTGA ATACCGGTCA AGTCTTGAAT AGGGGCCACT GGGATCTGAA TATATGCTGC TCATCGACCC GGACGTATGG TATCGCGCTA CCCCAAGCGT ATCTCGAGGC CTATATACCG TTGGGTAATG GCTTGAACAT CAAGGCAGGT CATTTTTATT CGCCGACAGG GTATGAAACC GTTCCTGCGC CCGACAATTT TTTTTACACT CACGCCTATA CCTTTAACAA TGGGGAGCCA TTCACCCACA CGGGTGCCGT AGGGAACTAT ACCGTCAACG GGAACTGGTC GGTGATGGGC GGCCTCATCA CCGGGAGCGC AACCGGCGAT TGGGATGGCG GATTCGACAG AGAGCTGGGA AACTGGGGAG GGCTGGGCGG TATTACCTGG GTCAGCGACG ACAAAAGGAC TTCGGCCAAC ATCACGGGGA GCTACAGTGC GACGTCCACC CGCAGCAACA GGCCATGGGG AATGTACAAC ATTGTGCTGC AACACAGGAT CACACCGAAA ACCCATCTGG TTCTACACCA CGTTCACGGC TATGCCGGCG GCGTTTTGCT GGGAGGGGTG CCAAAGAATG TCGAATGGTA TGGCATCAAT ACCCACCTCT ATTATGACTT GTTGGAGGAT CTCTCGGTGG GTATTCGCGG CGAATGGTTC CGGGACCGGG ATGGTTTTCG CGTCTCTTCG CCCTTTCGCG TGGTAGCGGC GCTCAACCAT ACCGGAATCA GTTTTGCAGG CGATTCTTCC ACAGTGAGGG CGGCCCCTGC GGACTACTAT GAAATCACCT TCGGAATGAA CTGGAAGCCA GCGAAAAGGT TGCGGCTGGA CTGGAAGCCG ATGCAAAAGC TGAACGTTCG TCCAAATATC CGCTATGACC GTGCGGACGG CATTGACCAT TCGCATCGGC CTTTCAACAA CAAGAAGGAT CAGATTATAT TTTCTCTCGA TGCGATGATT CCATTCTGA
|
Protein sequence | MKCRRKRSSR IPYGVGCFLL IFMSNSIVQA VDIAGWLREN GVRAGGWINA GATYNPGNPV NGYNGPVTFA DRANRFQLNQ FNLYLQRTVV TEGEAFDLGG RVDFMFGTDA IYTQAFGVPP FDVNTGQVLN RGHWDLNICC SSTRTYGIAL PQAYLEAYIP LGNGLNIKAG HFYSPTGYET VPAPDNFFYT HAYTFNNGEP FTHTGAVGNY TVNGNWSVMG GLITGSATGD WDGGFDRELG NWGGLGGITW VSDDKRTSAN ITGSYSATST RSNRPWGMYN IVLQHRITPK THLVLHHVHG YAGGVLLGGV PKNVEWYGIN THLYYDLLED LSVGIRGEWF RDRDGFRVSS PFRVVAALNH TGISFAGDSS TVRAAPADYY EITFGMNWKP AKRLRLDWKP MQKLNVRPNI RYDRADGIDH SHRPFNNKKD QIIFSLDAMI PF
|
| |