Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1004 |
Symbol | |
ID | 3785835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1164644 |
End bp | 1165888 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811088 |
Product | cytochrome b/b6-like |
Protein accession | YP_411699 |
Protein GI | 82702133 |
COG category | [C] Energy production and conversion |
COG ID | [COG1290] Cytochrome b subunit of the bc complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00661303 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC GATTGGAGGC AATGATCGGC TGGATCGACG CCCGATTTCC ATTGACAGCC AACTGGAGAG CGCATCTCAG CGAGTACTAC GCGCCCAAGA ACTTCAATTT CTGGTACTAC TTCGGTTCAC TTGCCATGCT CGTGCTGGTG AACCAGCTTC TCACCGGCAT TTTCCTCACG ATGAACTACA AGCCGGATGC CAGCATGGCA TTCGCCTCGG TGGAATATAT CATGCGTGAC GTGAATTACG GCTGGATTAT CCGTTACATG CATTCCACCG GCGCCTCGAT GTTCTTCGTG GTCGTCTACC TGCACATGTT CCGCGGGATG ATGTATGGCT CCTATCGCAA ACCGCGCGAG CTTCTCTGGG TCATCGGCAT GGGCATCTTT TTCGTCCTCA TGATGTTGGC ATTCACCGGC TATATCCTCC CCTGGGGGCA GATGTCATAC TGGGGCGCGC AGGTGATCAT CAGCATGATC GGCGCGATCC CGGTGATCGG CCAGACGCTG TCGAACTGGA TTCTGGGCGA CTTCATGCTC TCGGATGCGG CGCTCAACCG CTTCTTTGCC TATCACGTGG TGACCCTGCC CGGCCTGCTG GTAGTGCTGG TGATCGTCCA TATTCTGGCG CTGCATGAAG TAGGTTCGAA CAATCCGGAC GGGATCGAGA TCAAGGCCAA CAAGGACCCC GTGACGCATA TCCCCGTGGA TGGCATTCCC TTCCACCCCT ATTATTCGGT AAAGGATATT TTCGGCGTCG TCGTGTTCCT GATCGTGTTT ACCGGCATCA TATTCTTCAT GCCGGAAATG GGCGGATACT TCCTCGAGTA CAATAATTTC ATTCCTGCCA ACACGCTGCA GACTCCCGAC CACATCGCCC CAGTATGGTA TTTCACGCCA TACTATTCGA TGCTGAGGGC GGTGACAGTG AACTTTCTCT GGATAGATGC CAAGCTCTGG GGCATCGTGC TGATGGGCGG CTCCGTCGCA ATCTTTGCCT TGCTGCCATG GCTGGATCGC AGTCCGGTAA AATCCATCCG GTACAAGGGG CCCATTTTCA AGTTTGCCCT TACACTTTTT GTCATCAGCG TCCTTGTGCT TGGCTGGCTG GGCACAAAAT CGCCCACACC CCTCTACACG TTGCTCGCGC AGATATTCAC GGTAATCTAT TTTGCTTTTT TCATTCTCAT GCCGTGGTAC AGCAAGATCG ATAAAACCAA GCCTGAACCG ACCAGGGTGA GATAA
|
Protein sequence | MSKRLEAMIG WIDARFPLTA NWRAHLSEYY APKNFNFWYY FGSLAMLVLV NQLLTGIFLT MNYKPDASMA FASVEYIMRD VNYGWIIRYM HSTGASMFFV VVYLHMFRGM MYGSYRKPRE LLWVIGMGIF FVLMMLAFTG YILPWGQMSY WGAQVIISMI GAIPVIGQTL SNWILGDFML SDAALNRFFA YHVVTLPGLL VVLVIVHILA LHEVGSNNPD GIEIKANKDP VTHIPVDGIP FHPYYSVKDI FGVVVFLIVF TGIIFFMPEM GGYFLEYNNF IPANTLQTPD HIAPVWYFTP YYSMLRAVTV NFLWIDAKLW GIVLMGGSVA IFALLPWLDR SPVKSIRYKG PIFKFALTLF VISVLVLGWL GTKSPTPLYT LLAQIFTVIY FAFFILMPWY SKIDKTKPEP TRVR
|
| |