Gene Noc_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3074 
Symbol 
ID3705652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3466809 
End bp3468185 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content49% 
IMG OID637739548 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_345045 
Protein GI77166520 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG GAAAGACTGT GCAGATTATT GGCGCAGTAG TTGATGTGGA ATTTCCACGT 
GAATCCATTC CAAAGGTGTA CCATGCCTTG AGAATTGATG ACGTTGGTTT GACTCTGGAA
GTACAGCAGC AGCTTGGGGA TGGGGTCGTG CGTACAATTG CAATGGGAGG TTCCGATGGT
TTACGCCGGG GCATGGCGGT AACCAATACC GGGGCTCCTA TTTCAGTGCC AGTAGGTACT
AAAACGTTAG GGCGAATCAT GGATGTCCTA GGGGAGCCCG TGGACGAAGC GGGGCCGGTT
GGTGAAGAAG AGCGCTGGTC TATTCATCGT AAAGCGCCTG CCTATGAGGA ATTGTCCCCC
GCTACGGAGT TGCTGGAAAC CGGTATTAAG GTCATTGACT TAATATGCCC CTTTGCCAAA
GGGGGTAAGG TTGGCCTATT TGGGGGCGCT GGTGTGGGTA AGACCGTTAA TATGATGGAG
CTTATCCGTA ACATTGCAAC CGAGCATTCC GGCTACTCAG TGTTTGCGGG TGTTGGGGAA
CGGACTCGCG AAGGCAATGA TTTTTATCAT GAGATGAAAG ATTCCCAGGT TTTGGATAAG
GTTTCTCTGG TTTATGGGCA GATGAACGAA CCTCCTGGAA ATCGCCTGCG GGTAGCTTTA
ACGGGACTGA CCATGGCTGA ATTCTTTCGT GAAGAAGGTC GTGACGTATT GCTGTTTGTG
GATAATATCT ACCGCTATAC GTTGGCAGGT ACAGAGGTTT CGGCGCTGCT TGGCCGGATG
CCCTCCGCGG TAGGCTATCA GCCTACTTTG GCGGAGGAGA TGGGTGTTTT ACAGGAACGT
ATTACTTCCA CTAAGACGGG TTCTATTACC TCGATTCAAG CAGTATACGT CCCTGCGGAT
GACCTTACTG ATCCCTCTCC AGCTACGACT TTTGCCCATT TGGACGCTAC AGTAGTGTTA
TCCCGCCAGA TCGCAGAGCT TGGGATTTAC CCTGCTGTGG ATCCTCTTGA TTCAACCAGT
CGACAGCTTG ATCCTCTCAT TGTGGGGCAG GAACATTACC AAGTAGCACG GGCGGTGCAA
GGTAATTTGC AACGGTACAA GGAGCTTAAG GATATCATTG CTATCCTTGG TATGGATGAG
TTGTCTGAGG AAGATAAACT TACCGTGTCA CGGGCGCGGA AAATTCAGCG TTTTCTCTCT
CAGCCTTTTT TTGTGGCTGA AGTCTTTACA GGTAGTCCTG GCAAATATGT GCCTCTTAAA
GAAACCATTC AAAGCTTCAA AGGTATTGTT GAAGGTGAAT ATGATCATCT ACCAGAGCAG
GCCTTTTACA TGGTTGGCAC TATTGATGAA GCAGTAGAAA AGGCTAAGAA GCTTTAA
 
Protein sequence
MSTGKTVQII GAVVDVEFPR ESIPKVYHAL RIDDVGLTLE VQQQLGDGVV RTIAMGGSDG 
LRRGMAVTNT GAPISVPVGT KTLGRIMDVL GEPVDEAGPV GEEERWSIHR KAPAYEELSP
ATELLETGIK VIDLICPFAK GGKVGLFGGA GVGKTVNMME LIRNIATEHS GYSVFAGVGE
RTREGNDFYH EMKDSQVLDK VSLVYGQMNE PPGNRLRVAL TGLTMAEFFR EEGRDVLLFV
DNIYRYTLAG TEVSALLGRM PSAVGYQPTL AEEMGVLQER ITSTKTGSIT SIQAVYVPAD
DLTDPSPATT FAHLDATVVL SRQIAELGIY PAVDPLDSTS RQLDPLIVGQ EHYQVARAVQ
GNLQRYKELK DIIAILGMDE LSEEDKLTVS RARKIQRFLS QPFFVAEVFT GSPGKYVPLK
ETIQSFKGIV EGEYDHLPEQ AFYMVGTIDE AVEKAKKL