Gene Nmul_A2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2666 
Symbol 
ID3785673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3058964 
End bp3060451 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content55% 
IMG OID637812756 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_413345 
Protein GI82703779 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000219403 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTT ATACTTATAC CCCGGTTCCC GATCAGGGAG CCAAGGCGGG CGTGCTGGCC 
TATCTGGTCA TCTCCTGTGT AGTATTGCTA CTGATGATGC TTTTCGGCCT GCTCATGAGG
ATGGAGCAGG CGCAAATGAT CTCCATCGGA GCACTCTGGT TTTATAAAAT CATGACGTTG
CACGGTGCCG GAATGGTGGG TATCGTCGCC CTGGCCGGAG CGGCCATAAT GTGGCATTTT
CTGCGCCAGT ATGTCGATCT GTCGACAGGT ATATTCATGG CCAACCTGAT CCTGTTTCTG
ACAGGGGTCG TCATGATCCT GGCGAGTGTC CTGCTTGGCG ATTTCCACGC CGCCTGGACC
TTCCTCTTTC CGTTGCCCAC CACTTCCATG GGCATGTGGA GTCCGGGAGC TGCCGCGCTG
TTCATGGGCG GATTGCTGGT CATCGGCGTC GGTTTCGTGC TGCTGCATCT GGATATCGCG
CGCGCCATCA TCAGCCGCTA CGGCGGTCTG GGTCGCGGAC TTGGCTGGCC ACAGCTTTTC
GGTCCTGATG ACGGCAACGC ACCGCCGCCG GCCGTGGTTG CCAGCACCAT GGTCACCATC
GTCAACCTGA TTGGCCTAGT GGTCGGCGCC AGCATCCTGG TTATGATGCT GATCAATGTT
TACGTCCCGA CTTTCGAAAT CGATCCGCTG CTGGCCAAGG GCATGATCTA CTTCTTCGGA
CACGTATTCA TCAATGCCGT TATCTACATG GCTGTGATCG CGGTCTATGA AATCCTGCCG
CGCTATACGC AACGTCCCTG GAAGGCGAAC AAGGTGTTTC TTGCCTCCTG GACCGCTTCC
ACGATCATGG TCATGTTCAT CTTCCCGCAC CACCTGTTGA TGGATTATGC CTATCCCAAG
TGGTTCCTGA TCATGGGTCA CATCATCGGT TATCTCAATA CCTTCCCGAT CCTGATCGTG
ACGGGTTATG GCGCCATGAT GATCGTGTAC CGGTCGGGTA TTCGCTGGGA TATGTGCTCA
CGGCTGCTGT TCGTGTCGCT CTTCGGTTGG GCAGTCGGCG CGATGCCCGC GTTCATCGAC
GGCACCATCA CGGTCAACTA TGTCATGCAC AACACGTTGT GGGTGCCGGG ACATTTCCAT
ACCTATCTGT TGCTCGGCAT GGTTGCGATG GTCTTCGGGT TCATGTATTA CCTTGGCAAG
CCGAACGAGA ATGCGCCGGA TAGCGCCCTT GACGTCGCTG CTTTCTGGGG ATTCGTTATC
GGCACCATGG GTTTCACCAT GAGCTTTCTC TATTCCGGGA AAATCAGCGC CGCGCGCCGC
TATGCGGAAC ATCTTCCGGA ATGGGTGCCT TACGACAAAA TTGCTTCATA TTTCGCAATG
TTGTTGATCG CTTCTGTGCT AGTATTCATT TTCCGCTTTC TTACGCGGTT GGGGCTGGCG
AGTCGCGATT ATCAACGCGC CTCTCTTGCG CGAAGTATGG CTACATGA
 
Protein sequence
MATYTYTPVP DQGAKAGVLA YLVISCVVLL LMMLFGLLMR MEQAQMISIG ALWFYKIMTL 
HGAGMVGIVA LAGAAIMWHF LRQYVDLSTG IFMANLILFL TGVVMILASV LLGDFHAAWT
FLFPLPTTSM GMWSPGAAAL FMGGLLVIGV GFVLLHLDIA RAIISRYGGL GRGLGWPQLF
GPDDGNAPPP AVVASTMVTI VNLIGLVVGA SILVMMLINV YVPTFEIDPL LAKGMIYFFG
HVFINAVIYM AVIAVYEILP RYTQRPWKAN KVFLASWTAS TIMVMFIFPH HLLMDYAYPK
WFLIMGHIIG YLNTFPILIV TGYGAMMIVY RSGIRWDMCS RLLFVSLFGW AVGAMPAFID
GTITVNYVMH NTLWVPGHFH TYLLLGMVAM VFGFMYYLGK PNENAPDSAL DVAAFWGFVI
GTMGFTMSFL YSGKISAARR YAEHLPEWVP YDKIASYFAM LLIASVLVFI FRFLTRLGLA
SRDYQRASLA RSMAT