Gene Nmul_A1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1004 
Symbol 
ID3785835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1164644 
End bp1165888 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content54% 
IMG OID637811088 
Productcytochrome b/b6-like 
Protein accessionYP_411699 
Protein GI82702133 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00661303 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC GATTGGAGGC AATGATCGGC TGGATCGACG CCCGATTTCC ATTGACAGCC 
AACTGGAGAG CGCATCTCAG CGAGTACTAC GCGCCCAAGA ACTTCAATTT CTGGTACTAC
TTCGGTTCAC TTGCCATGCT CGTGCTGGTG AACCAGCTTC TCACCGGCAT TTTCCTCACG
ATGAACTACA AGCCGGATGC CAGCATGGCA TTCGCCTCGG TGGAATATAT CATGCGTGAC
GTGAATTACG GCTGGATTAT CCGTTACATG CATTCCACCG GCGCCTCGAT GTTCTTCGTG
GTCGTCTACC TGCACATGTT CCGCGGGATG ATGTATGGCT CCTATCGCAA ACCGCGCGAG
CTTCTCTGGG TCATCGGCAT GGGCATCTTT TTCGTCCTCA TGATGTTGGC ATTCACCGGC
TATATCCTCC CCTGGGGGCA GATGTCATAC TGGGGCGCGC AGGTGATCAT CAGCATGATC
GGCGCGATCC CGGTGATCGG CCAGACGCTG TCGAACTGGA TTCTGGGCGA CTTCATGCTC
TCGGATGCGG CGCTCAACCG CTTCTTTGCC TATCACGTGG TGACCCTGCC CGGCCTGCTG
GTAGTGCTGG TGATCGTCCA TATTCTGGCG CTGCATGAAG TAGGTTCGAA CAATCCGGAC
GGGATCGAGA TCAAGGCCAA CAAGGACCCC GTGACGCATA TCCCCGTGGA TGGCATTCCC
TTCCACCCCT ATTATTCGGT AAAGGATATT TTCGGCGTCG TCGTGTTCCT GATCGTGTTT
ACCGGCATCA TATTCTTCAT GCCGGAAATG GGCGGATACT TCCTCGAGTA CAATAATTTC
ATTCCTGCCA ACACGCTGCA GACTCCCGAC CACATCGCCC CAGTATGGTA TTTCACGCCA
TACTATTCGA TGCTGAGGGC GGTGACAGTG AACTTTCTCT GGATAGATGC CAAGCTCTGG
GGCATCGTGC TGATGGGCGG CTCCGTCGCA ATCTTTGCCT TGCTGCCATG GCTGGATCGC
AGTCCGGTAA AATCCATCCG GTACAAGGGG CCCATTTTCA AGTTTGCCCT TACACTTTTT
GTCATCAGCG TCCTTGTGCT TGGCTGGCTG GGCACAAAAT CGCCCACACC CCTCTACACG
TTGCTCGCGC AGATATTCAC GGTAATCTAT TTTGCTTTTT TCATTCTCAT GCCGTGGTAC
AGCAAGATCG ATAAAACCAA GCCTGAACCG ACCAGGGTGA GATAA
 
Protein sequence
MSKRLEAMIG WIDARFPLTA NWRAHLSEYY APKNFNFWYY FGSLAMLVLV NQLLTGIFLT 
MNYKPDASMA FASVEYIMRD VNYGWIIRYM HSTGASMFFV VVYLHMFRGM MYGSYRKPRE
LLWVIGMGIF FVLMMLAFTG YILPWGQMSY WGAQVIISMI GAIPVIGQTL SNWILGDFML
SDAALNRFFA YHVVTLPGLL VVLVIVHILA LHEVGSNNPD GIEIKANKDP VTHIPVDGIP
FHPYYSVKDI FGVVVFLIVF TGIIFFMPEM GGYFLEYNNF IPANTLQTPD HIAPVWYFTP
YYSMLRAVTV NFLWIDAKLW GIVLMGGSVA IFALLPWLDR SPVKSIRYKG PIFKFALTLF
VISVLVLGWL GTKSPTPLYT LLAQIFTVIY FAFFILMPWY SKIDKTKPEP TRVR