Gene Noc_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2088 
Symbol 
ID3704948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2398025 
End bp2399152 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content51% 
IMG OID637738563 
ProductH+-transporting two-sector ATPase, C (AC39) subunit 
Protein accessionYP_344078 
Protein GI77165553 
COG category[C] Energy production and conversion 
COG ID[COG1527] Archaeal/vacuolar-type H+-ATPase subunit C 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.295713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCGTC CTTTTTTAAC TATCACCAAG AGCCCTTCCA TAGCCACCGT TACCCGCTAT 
GCTTATTTGA ATACCTTGGT CTCGGCTCTC TCCAAGCGTC TCCTGTCAGC AGAACAGCTC
CGGAATCTGG TGGACCAATC GGCCTCGGAT GTTTCGGTTT TACTTCGTAC CGCCGGGTTG
ACGGGGATTT CTCTGCAAGC AATGGAGGAT CGTTCTTTAG AGCAAGTGCT GGTGGATACC
TTATGGGCGG AAGCCCAACG GCTGATTCGT CCTCTGAGCG CTGAAGCCCA GGAACTCGCA
AGCTATTGGC TACGCCGTTT TGAGATAGGT AACCTGAAAA TTGTGCTCCG GGGTAAGTTG
ACTGGACGTC CAAAGGAGGC CATCCAAGCC GATCTTATTA AGATTAATGG AATAGCGGCC
TTGCCCTTGA ACTCGTTGAT GGAGGCCGAG GATGCTCAGG AGGTGCTCCA TTGTTTAGAG
GGAACACCCT ATGCTGGTGT TGCCCGCCAG GCCCGTTCGG TCTATGGAGG AGAGCATTAT
GATGTGCCTC ATGTCGGAGG AGAGGGTCGA GAATTGTTTC TTATCGAAGC CACAATAGAT
AGGGAATACT ATAGTGGATT GGCGCGGCGA GTAAATGCTA TCCAGGAAGA ACGGGACCGC
CATTATTTGC GAATATTGAT CGGTTATCTC ATCGACCAAA CCAACTTGGT CTGGTTATTG
CGCTACCGGT TTGCTTACCG TCTTGGGCCA CCGCTGACCT ATTTTCTTTT ATGCCCCGGT
GGGTATCATT TGCGTAGTCA GCACTTGCTG GCCCTCGCGC GTGGGGAAAA CTTCGAGGAG
ACTTTACACA ATCTACCTGC TCCTTTAGCC CCTTTGGTAG CAGGGGCAGT TTCCACTTCC
GAGGTGGAGG AGATGCTAGG AAAACACTTA CTCGAAGTCG CGCAGTTTAT CCTGAAACGC
ACCACTTTTA ATTTGGGACG GGCCTTCGCT TATCTATTTC TTCGTGAAAA AGAATTATTA
CGCATCCATG GGATCATCAA GGGCCGCACT TTGCAGCTCG CCCCCCATCT TATTCACCAG
GCCATGGGAC TGGAAAGTGC CGGTTTCAAG GAGGATGAGT CTTTTTAG
 
Protein sequence
MHRPFLTITK SPSIATVTRY AYLNTLVSAL SKRLLSAEQL RNLVDQSASD VSVLLRTAGL 
TGISLQAMED RSLEQVLVDT LWAEAQRLIR PLSAEAQELA SYWLRRFEIG NLKIVLRGKL
TGRPKEAIQA DLIKINGIAA LPLNSLMEAE DAQEVLHCLE GTPYAGVARQ ARSVYGGEHY
DVPHVGGEGR ELFLIEATID REYYSGLARR VNAIQEERDR HYLRILIGYL IDQTNLVWLL
RYRFAYRLGP PLTYFLLCPG GYHLRSQHLL ALARGENFEE TLHNLPAPLA PLVAGAVSTS
EVEEMLGKHL LEVAQFILKR TTFNLGRAFA YLFLREKELL RIHGIIKGRT LQLAPHLIHQ
AMGLESAGFK EDESF