Gene Noc_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2083 
Symbol 
ID3704943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2392739 
End bp2394514 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content58% 
IMG OID637738558 
ProductV-type ATP synthase subunit A 
Protein accessionYP_344073 
Protein GI77165548 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.117647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAC TGCTTGAAGT CAATGGTCCG CTGGTGAGGG CGCGTCTGCC CCAGGTACCC 
AATGGGGAGC AGGTGCGAAT TGGAACTCTC GGTTTGGTGG GCGAAGTCAT TGGCAGGGAA
GGCCAAGAGG CCCTGATTCA AGTTTATGAG GGGACTGAGT CGGTGCGCCC TGGGGAGGAA
GTCGAAGCGC TCGGCCATCC CCTCTCAGTG GAACTGGGAC CGGGCCTTCT GGGACAAGTT
TTTGATGGGA TACAACGTCC CCTTGGCCGT CTCCTGGAAG CCAGCGGGGA TCGGATTTCC
CGTGGCATCC AGATCCAGGG TTTGGAGCAA GCGCGGGTTT GGCGCTTTCA ACCCAACCCC
CAATTAGCTG CCGGCATGGC GGTTACCGGC GGCGTTTGTT TAGGCGCCGT GCCGGAGACC
CCTACTATTG AGCATCGTAT TCTGGTTCCT CCTGGATTGT CGGGTGAACT GTTGGAGCTG
GCCCCTGAAG GGGAGTATCG ATTGAGCGAT GTCATTGCCC GCCTGGATAT GGGTGATCAT
CGCTCCCAGG CGTTAACGCT TTCCCATCGC TGGCCAGTAC GTAACCCCCG TCCCTACCAG
CAGCGGGAGC ATGGGGTGTC GCCATTGATG ACCGGTCAGC GGATACTGGA TACTTTTTTC
CCCCTGCTCA AAGGCGGCAA AGCGGCTGTG CCTGGACCTT TTGGCGCCGG TAAAACCATG
GTCCAGCAAC AAATTGCCCG CTGGTCTAAT GCCGATATCG TCATTTATGT GGGTTGCGGT
GAGCGGGGCA ACGAGTTGGT TGAAGTCCTG GATTCCTTTC CCGAGCTAAC TGATCCCCAT
ACCGGCCGCT CCTTGATGGA GCGGACCTTG CTGGTTGCCA ATACCTCGAA TATGCCCGTG
GTTGCCCGGG AGGCATCCCT ATATGTCGGT GTAACCCTGG GGGAGTATTA CCGGGACCAG
GGCTACGATG TGGTGATTGT GGCCGATTCC ACCAGCCGCT GGGCCGAAGC CCTGCGGGAA
GTGGCGGGGC GTCTCGGACA GATGCCTGTG GAGGAAGGTT ATCCGGCTTA TTTAGCTTCC
CGGCTGGCCG CCTTCTATGA GCGTGCCGGG CGGGTTCAAA CCCTGGGGGG AAGCGTAGGC
TCGGTGACTC TCATTGGCGC GGTCTCCCCG CCAGGGGGGG ATTTTTCCGA ACCCGTAACT
AGTCATACCA AGGAAATCGT GCGGACCTTT TGGGCGCTCT CAAAAGACCT AGCGGACGCC
CGCCATTACC CGGCCGTGTC CTGGCGAGAA AGTTTTTCCG ATGATATTCC CGTGGCCGCC
CGCTGGTGGG CTGAACACAT TGATAAACAT TGGCAGGCAG GACGCGCCGA GGCCATGACT
TTGCTGACCC AGGCGGAAGA ACTGTCCCGG ATTGTCAATC TGGTAGGCCC CGAGGCTTTG
TCGGGAACTC AGCGCTGGAT TCTAGAAGGG GCAACGCTGA TTAAGGAAGG ACTGTTGCAA
CAAAGCGCCC TGGACCCAGT GGATAGTTTT TGCGCCCCCG AGAAGCAGTT TGTCCTCCTG
GACTTGATGC TCCAAATTTA TCATCAAGGC GTCGAATTAC TAGAGCAAGG CGTGCCGGTG
CAAGAGCTTT TGGGCCTTCC CGTGCTGGCC CGCGCTAGGC GCTGCAAGAG TGATTATAAA
AATACCCAAG TGGAAACACT TCAGGATTTT ACTAAGGAAA TAAAAGAAGC TTTCGGGCGG
CTTGGCAGGG AACATGCCGA GGCGGGAAAA ATCTAG
 
Protein sequence
MGKLLEVNGP LVRARLPQVP NGEQVRIGTL GLVGEVIGRE GQEALIQVYE GTESVRPGEE 
VEALGHPLSV ELGPGLLGQV FDGIQRPLGR LLEASGDRIS RGIQIQGLEQ ARVWRFQPNP
QLAAGMAVTG GVCLGAVPET PTIEHRILVP PGLSGELLEL APEGEYRLSD VIARLDMGDH
RSQALTLSHR WPVRNPRPYQ QREHGVSPLM TGQRILDTFF PLLKGGKAAV PGPFGAGKTM
VQQQIARWSN ADIVIYVGCG ERGNELVEVL DSFPELTDPH TGRSLMERTL LVANTSNMPV
VAREASLYVG VTLGEYYRDQ GYDVVIVADS TSRWAEALRE VAGRLGQMPV EEGYPAYLAS
RLAAFYERAG RVQTLGGSVG SVTLIGAVSP PGGDFSEPVT SHTKEIVRTF WALSKDLADA
RHYPAVSWRE SFSDDIPVAA RWWAEHIDKH WQAGRAEAMT LLTQAEELSR IVNLVGPEAL
SGTQRWILEG ATLIKEGLLQ QSALDPVDSF CAPEKQFVLL DLMLQIYHQG VELLEQGVPV
QELLGLPVLA RARRCKSDYK NTQVETLQDF TKEIKEAFGR LGREHAEAGK I