Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0770 |
Symbol | |
ID | 3707036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 838653 |
End bp | 841526 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637737272 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_342813 |
Protein GI | 77164288 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.815806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCTA TGAAGCCACA AACTCCTGAA ACCAACCCCG CGCTCCAAGA TCAGACAGGG AATACCACTC CCTGGCGGGA TAAGGAACTG CGCGCCCGCG TCAAACTATT TGGCAATCTC CTAGGCCAGG TCATCCAAAA CCAGTCCGGG GAAAAAGTAT TCGCCGCTGT CGAGGCCTTG CGCAAGGGCT ATATTAACCT GCGCAAAAAA GAAAATTCTG ACAAGCGAAT CCAGTTGCTG CGCCTTATCG ACACGCTGAA TGTGGAGAAG ATCACCCAGG TCGTGCGGGC CTTTAGCATC TACTTTAGCC TCGCCAATAT TGCCGAAGAA GCTTACCAGC ACCGGCAACG GCAACGCCGC ATCGATGCAG GCGGACCTCT TTGGCGAGGC TCCTTCGAAG AGACTCTACG AGAACTCCGG AAGAGCGGAA TTAGTCCCGA GCAACTCCAG ATTATGCTGG ATAATCTAGC TTATATTCCC GTCATTACCG CCCATCCCAC CGAAGCCAAG CGCCGCACGG TGATGGAACA TCTCCGCAAA ATCTTCCTTG CCAGTAAACT TCTGGACGAG ACGCGCCTCA GTCAGCGGGA GGAAGAAACC CTCCACCGCC AATTAGAACG GCAAATCCAG GTCCTGTGGA AAACCGATGA AGTTCGCGCC CACCGGCCCC AGGTCCAGGA CGAAATCATC AACGGCCTAT TTTATTTCAA AGTTAGCTTG TTCCAGGCCG TACCCGAAAC CTATCGCCAA CTAGAAGAAG CCATCAACAA AGTCTATGGC GACATGCTGC CCGAGAGCAC CACTCTCCGG GTCCCCAGCT TTTTACACTT TGGCTCCTGG ATTGGCGGAG ACCGGGATGG CAACCCCAAT GTCACGCCGG AAGTCACCGC CATGGCCGTA CGGCTGCAAA TGCGGATGGC CCTACAGCAT TATATCGCCT GCATTACTAA ATTAACGCGC ATTCTCACCC ATTCCATTCC TCTCATCAGG CCCTCAACGG CCCTGACTGA AAGCATTAAC CAAGATTTGA GCGATTGCCC GGAGACCTTT CGAGGCGATC CCGACCGCTT TAGCCGCGAA CCTTACCGGC GCAAACTCTA CCTGATGCGC TACCGCCTGA TGGATAACTT ACGGGCCGTG GAACGATATC TCCTGCCAGA AATCCAGCCC ACCCCCCCTC AAGGCGTCGG TTATCCTTCC GAGGAAAAAT TTCTTGAGGA TCTCTGCCTC CTTCGCGACA GCCTGAGCAG CCATGGTGAC GGCAACATTG CCGCAGGGGA ATTGCAAGAT CTGATTCGCC TAGTAGAAAG TTTTGGCTTT TACCTCCTCA AACTGGATGT TCGCCAGGAA TCAGGCCGTC ATACGGAAGC AGTAGCGGAA TTAGTCAAAC ACCTCGACCT GCATCCCAGC TATCTCGATC TTTCCGAAAC TGAACGGCTA GGGCTCCTTT CCGAACAACT CGCCCGCGAA GAAGAAACCA CCATCCAGCG GGAGCGGCTT ACCCCCGCTA CCCGGGAAAC ACTGGATCTC TTTCACGTCA TGGCGCAAAT GCGCCAAGAG GTCAGCCCCC GAGTTTTCGG TCATTATGTG ATTTCCATGA CCCATGCCGC CAGTCATGTC ATGGAGGTCA TGTATCTAGG CTATCTTGCT GGCCTCGCTG GACGTCGGCG GGGTCAATGG CATTGTGGTC TGCAAATCTC TCCCCTGTTT GAAACCATCG AGGATTTAGA GCATATCGAG CCGGTCATGA CCGCCCTGCT TGATGATCCC AGCTATCGAG CCTTACTACA GGCCGCCGGC AACCAGCAAG AAGTCATGAT TGGCTACTCA GATTCCTGCA AGGACGGCGG TATCCTGGCT TCCTCCTGGA AACTGTATGA CGCCCAGAAA AAGGTAACCG GGCTCACCGA TAGCCGGGGG GTGGATTGTC GTATCTTCCA TGGCCGGGGC GGGACCATTG GCCGGGGCGG TGGCCCAACT TTTGACGCTA TCCTGTCCCA ACCCCAGGGG ACTGTCCACG GTCAAATCAA GTTCACGGAA CAGGGAGAAG TCCTCTCCTC CCGCTACAGT AACCCCGAGA CCGCGATTTA TGAACTCAAC ATGGGTATCA GTGGCCTGAT TAAGGCCAGC ACCTGCCTCG TCCAACCTCC CCAGGAAGAA AAGCGTGATT ATCTCGGTAT CATAGACTCT TTAGTGGAAA CAGGGGAGCA GACTTACCGG GAATTCACGG AACAAACGTC CGGCTTTCAA GATTATTTCT ATGAAGCCAC CCCGGTCAAT GAAATTGGTC TGTTGAACAT TGGCTCCCGC CCCCCCCACC GGAAAAAAGG AGATCGTTCC AAGAACTCAG TCCGGGCTAT CCCCTGGGTC TTTGGCTGGG CCCAGGCCCG GCATACTTTT CCCGCCTGGT ATGGTATCGG CAGCGCCTTG GAAAAATGGC GGGCTGGCGC GCCCGATCGG CTCGCAAAAC TCCAAACCAT GTATGAAGAG TGGCCCTATT TCCGTGCCCT GCTCAGCAAT ACCCAAATGT CCCTGGCCAA GGCCGAGTTG CATATCGCTC AGCAATATGC CGGCTTGTGC CTAGATCCAG AAACGGGACA GAAAATCTTT GCCCTGCTCA GCGCGGAGTA CCAACGCACG GTCACCCAGG TGCTCCATAT CGTGGGGGCC CACACCCTGC TGGAGGAGAA CCCTCCCCTG GCTCTGTCCT TGCAACGCCG GGACCCCTAC CTGGACCCCC TCAATCATAT CCAACTCACT CTTCTTAAAC GCACCCGCGA TCCACGAATC ACTCCCGAGG AGCGGGAAGC ATGGCTTGAT CCTCTGCTCC GTTCTATCAA TGCCATCGCG GCTGGGATGC GCAATACGGG CTGA
|
Protein sequence | MTSMKPQTPE TNPALQDQTG NTTPWRDKEL RARVKLFGNL LGQVIQNQSG EKVFAAVEAL RKGYINLRKK ENSDKRIQLL RLIDTLNVEK ITQVVRAFSI YFSLANIAEE AYQHRQRQRR IDAGGPLWRG SFEETLRELR KSGISPEQLQ IMLDNLAYIP VITAHPTEAK RRTVMEHLRK IFLASKLLDE TRLSQREEET LHRQLERQIQ VLWKTDEVRA HRPQVQDEII NGLFYFKVSL FQAVPETYRQ LEEAINKVYG DMLPESTTLR VPSFLHFGSW IGGDRDGNPN VTPEVTAMAV RLQMRMALQH YIACITKLTR ILTHSIPLIR PSTALTESIN QDLSDCPETF RGDPDRFSRE PYRRKLYLMR YRLMDNLRAV ERYLLPEIQP TPPQGVGYPS EEKFLEDLCL LRDSLSSHGD GNIAAGELQD LIRLVESFGF YLLKLDVRQE SGRHTEAVAE LVKHLDLHPS YLDLSETERL GLLSEQLARE EETTIQRERL TPATRETLDL FHVMAQMRQE VSPRVFGHYV ISMTHAASHV MEVMYLGYLA GLAGRRRGQW HCGLQISPLF ETIEDLEHIE PVMTALLDDP SYRALLQAAG NQQEVMIGYS DSCKDGGILA SSWKLYDAQK KVTGLTDSRG VDCRIFHGRG GTIGRGGGPT FDAILSQPQG TVHGQIKFTE QGEVLSSRYS NPETAIYELN MGISGLIKAS TCLVQPPQEE KRDYLGIIDS LVETGEQTYR EFTEQTSGFQ DYFYEATPVN EIGLLNIGSR PPHRKKGDRS KNSVRAIPWV FGWAQARHTF PAWYGIGSAL EKWRAGAPDR LAKLQTMYEE WPYFRALLSN TQMSLAKAEL HIAQQYAGLC LDPETGQKIF ALLSAEYQRT VTQVLHIVGA HTLLEENPPL ALSLQRRDPY LDPLNHIQLT LLKRTRDPRI TPEEREAWLD PLLRSINAIA AGMRNTG
|
| |