Gene Noc_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2087 
Symbol 
ID3704947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2396096 
End bp2397982 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content53% 
IMG OID637738562 
ProductV-type ATPase, 116 kDa subunit 
Protein accessionYP_344077 
Protein GI77165552 
COG category[C] Energy production and conversion 
COG ID[COG1269] Archaeal/vacuolar-type H+-ATPase subunit I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.781263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGACAT CCTTATCAAT GCGGCGAGTT ACCCTTCATA TGGTCAAGGA AGAGGCTCCG 
GAGGCTGCCC TTAGCTTAGC GGAAAGCGGG GTGTTTAGTC CGGAACCCCT TCCGGCCGGG
GAGGAAAGTT TGCCGGAGTG TCCCGCCGCC AGCTACCAGG CGTTATTTCA TGATGTCCAG
ATGCGTTTGC AAAAGGTGAG CGAATACCTG GGGGTCCAGG CTGCCATCCC GCCAGAAAAG
GTGCGGGCCA TCAGCGAGTC AGAGTTGGCG GCGTTTAATA AGTGGTTGGG GAGGCTCTGG
CGGCGTTGTT CTCAGTTGCA AGAAAAAGGG CGGGAACTGG ATGAGAAGAT CCGATCCATT
GGCCAATTAG ACAAGACCTT GGATACCTAT GCCCGATTGG ACGTGAATTT AGGATTATTA
CAGGGGCAAT TGCAATTTTT AGATGTTCAG CTTGGAGCGG TCCCGCAAAG TAATTTCGGC
AAACTCCAGG AGGCGGTAGG CATGGCGGGC TATATTCTTA AACCGTTTGC TGAATCTGAT
TCCGCGGCCT TGGTTGTGCT TGCCGGGGTG AAAGGAAATG AACGGCAAGT GCAAAGCGTG
CTTCGGGCGG CCGCCTACCG TCCACTTCAA CTGCCCGCCG AATTTCAGCA TCATCCCCAG
CAGGTTCGCC AACAGCTCGC GGCTCAGCGC CTGCGTTTCC AAGAGGAGCG CCTGGCGCTG
GCTCAGGAGC GGGAGTTGCT TCATAAAAAG CATGAACAGG CGTTGCATAA GGGAGCCCAG
AGGATGATTC TAGCGGCTCC CTATGCTTTT CTAGGCGCTT CTCTACGGAG CCAGGGCGGG
CTGGCGACGG TACAAGGCTG GGTACCAACG GAAAAAATAA GTTGTTTGCG GAAGACCTTG
CAAAGACGAC TAGAGCAGCG TTTTGTGCTA GAAACGCGAG ATCCGACCTT AGATGAGCGG
TTGATGACGC CTTCCCTAAT CCGGGTTCCT CGCTGGCTCC AGCCTTTTAC CGATGTAGCG
CATAATTATG GTGTGCCCCG TTATGGTGAA TTAGATCCTT CCTGGCTATT TGCTCTGACC
TTCATTGCCA TGTTCGGCAT GATGTTCGGC GATGTGGGTC ATGGCGCAGC TATCCTGGCG
GTTGGCTGGG GGTGCCGCCG TTGGTTGGGG CACTATACAC CTTTCGTACT CTCCCTGGGG
GTTTCTTCTT GCTGCTTTGG CTTTCTATAT GGCAGCGTTT TTGGTTATGA AGAGATCCTT
TCTCCCTTAT GGCTATCGCC CTTGTCCAAT CCCCAACGGA TGCTTATGGT CGCCCTTTTT
TGGGGAGTGG GGTTTATTCT TCTAGTGACG GCTATGGGGA TTCGCAACCG TTTAGCGGAG
GGGTGTTATG CCGAGGCTCT ATTAGGAGGG CGGGGACTGG CGGGGAGTTT ATTATATTTA
GGCGCCGTAT ATGGTGCGTT CCGCTGGATG GAAGAGGGCG TATTCGGGGC GATGGAGGCA
GCGGCTATTG TCTTGCCGTT TCTGGTTCTG CTGGGGTATC AATGGCATCG GTCCCAGGTT
CCCGGCTTTG GACGGGGGGT GGTCGTGCTT ATCGAGAGTT TTGAAATAAT CATGGGTTAC
TTTGCCAATA CCCTTTCTTT TCTTAGGGTC GCCGCTTTTA GCTTGAATCA TGTGGCGCTG
GCGTTGGCTG TATTTGCCCT GGCTGGGACA ATGGAGGCGG TTGGCCATTG GGTCACGGTA
GTAGTAGGCA ATCTTTTTAT TTTGATTTTA GAAGGGGCCA TTGTGGCCAT TCAGGTATTA
CGGCTGGAAT ATTACGAGGG GTTTTCCCGC TTCTTTAGTG GCGATGGTCG GGCCTTTGAG
CCGTTGATAT TAGGGCCTCT AAAATAA
 
Protein sequence
MLTSLSMRRV TLHMVKEEAP EAALSLAESG VFSPEPLPAG EESLPECPAA SYQALFHDVQ 
MRLQKVSEYL GVQAAIPPEK VRAISESELA AFNKWLGRLW RRCSQLQEKG RELDEKIRSI
GQLDKTLDTY ARLDVNLGLL QGQLQFLDVQ LGAVPQSNFG KLQEAVGMAG YILKPFAESD
SAALVVLAGV KGNERQVQSV LRAAAYRPLQ LPAEFQHHPQ QVRQQLAAQR LRFQEERLAL
AQERELLHKK HEQALHKGAQ RMILAAPYAF LGASLRSQGG LATVQGWVPT EKISCLRKTL
QRRLEQRFVL ETRDPTLDER LMTPSLIRVP RWLQPFTDVA HNYGVPRYGE LDPSWLFALT
FIAMFGMMFG DVGHGAAILA VGWGCRRWLG HYTPFVLSLG VSSCCFGFLY GSVFGYEEIL
SPLWLSPLSN PQRMLMVALF WGVGFILLVT AMGIRNRLAE GCYAEALLGG RGLAGSLLYL
GAVYGAFRWM EEGVFGAMEA AAIVLPFLVL LGYQWHRSQV PGFGRGVVVL IESFEIIMGY
FANTLSFLRV AAFSLNHVAL ALAVFALAGT MEAVGHWVTV VVGNLFILIL EGAIVAIQVL
RLEYYEGFSR FFSGDGRAFE PLILGPLK