Gene Nmul_A1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1654 
Symbol 
ID3785596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1892977 
End bp1894512 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content57% 
IMG OID637811740 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_412344 
Protein GI82702778 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit
[TIGR03324] alternate F1F0 ATPase, F1 subunit alpha 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCAAG ATCGATTGCG AGACGTTTTT GATGGCGCTT TTTCCCGGAT TGCTCTCGCG 
CGGAAAATCG TGACTCCCCG ACTCATGCTG AAGGAGATCG GGGTAATTAA GAGCATTGGT
ACTGGCATAG CGAAGGTTTC CGGCCTTCCC GGCACGTGCT TCGAGGAGGT GCTCAAGTTT
CCCGGCGGCT ATGGCATTGC ATTTAATATC GAAGAAGAGG AAATCGGGGT GATCCTGCTG
GGTGATCATT CTCACCTGCG TGCAGGAGAT GAAGTGGAGC GCAGGGGACA CGTAATGGAT
GTGCCGGTGG GCGACGCGTT GATCGGAAGG ATAGTCAATC CGCTGGGCCG GGCGCTCGAT
GGCGAAGCTC CGGTAATATC CTCGCGCCGT ATGCCAATCG AACGTCCCGC TCCCCAGATT
ATGGATCGGG CTCCTGTCAC AGTGCCGCTT CAGACGGGCA TCAAGGTCAT CGATGCACTG
ATTCCTGTAG GGCGGGGGCA GCGCGAACTG ATACTGGGCG ACCGGCAGAC GGGCAAGAGC
GCTATAGCAC TCGACACCAT ACTGAATCAG AAAGACGAGA ACGTGGTGTG CATCTACTGC
GCCATCGGGC AGCAAGCATC GAGTGTTGCC AAGGTAGTGG CTGCGCTGCA GGAAAATGAC
GCTCTGAGCT ATACAGTTGT GGTAGTAACC GAAGGCAATG ACCCGCCGGG GCTGATCTAT
GTTGCCCCCT ATGCCGCGAC TGCTATCGGA GAATACTTCA TGGAGCAGGG CCGGGACGTT
TTGATCGTTT ACGATGACCT TTCGCACCAC GCTCGCGCCT ACCGTGAACT TTCGCTGCTG
ATGCGGCGCC CCCCTGGACG CGAAGCCTAT CCCGGAGACA TCTTTTACAT CCACTCACGC
TTGCTGGAGC GCGCTACGCA TCTGCGCCCC GAACTCGGCG GCGGTTCCCT GACCGCCTTG
CCGATCGTAG AAACAGAAGC GGAGGACATC GCAGCATATA TTCCAACCAA TCTGATCTCG
ATTACCGACG GGCAGATTTA CCTTTCACCC ACATTGTTTC AACTTGGAAT ACTGCCCGCC
ATCGACGTTG GCAAATCGGT TTCGCGGGTG GGAGGCAAGG CGCAGCGGCC TGTGTACCGG
GCCGCTACCG GCGAGCTGAG GCTGGATTAT TCCCAGTTCA GCGAACTGGA AACATTTACA
CGCTTCGGGG GACGCCTCGA TGAGCGTACC CGTACCGTTA TCGAGCATGG CCGGCGTATT
CGTGCCTGTT TGCAGCAGCC AGAGTCCAGC CCGGTTTCCG TCTCAGAGCA GATCATTCTG
CTGCTGGCAT TGACCGCAAA ACTATTCGAC GAGGTACCGC TGGAAAACAT GGGGGAGGCG
GAGCGGGCGG TGAGGGCAGT AGTGTTGAAT ATACCTCCCG AACTGCTTGC ACGTATGGAA
GCCAACGAAA GCCTGAACGA TGCAGATCGC CATGCACTTT TGTGGCATGC AAGCACTGCG
CTGGATGGGT TAGGCTCAGC ACATGCGAAA GCCTAA
 
Protein sequence
MNQDRLRDVF DGAFSRIALA RKIVTPRLML KEIGVIKSIG TGIAKVSGLP GTCFEEVLKF 
PGGYGIAFNI EEEEIGVILL GDHSHLRAGD EVERRGHVMD VPVGDALIGR IVNPLGRALD
GEAPVISSRR MPIERPAPQI MDRAPVTVPL QTGIKVIDAL IPVGRGQREL ILGDRQTGKS
AIALDTILNQ KDENVVCIYC AIGQQASSVA KVVAALQEND ALSYTVVVVT EGNDPPGLIY
VAPYAATAIG EYFMEQGRDV LIVYDDLSHH ARAYRELSLL MRRPPGREAY PGDIFYIHSR
LLERATHLRP ELGGGSLTAL PIVETEAEDI AAYIPTNLIS ITDGQIYLSP TLFQLGILPA
IDVGKSVSRV GGKAQRPVYR AATGELRLDY SQFSELETFT RFGGRLDERT RTVIEHGRRI
RACLQQPESS PVSVSEQIIL LLALTAKLFD EVPLENMGEA ERAVRAVVLN IPPELLARME
ANESLNDADR HALLWHASTA LDGLGSAHAK A