Gene Noc_0770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0770 
Symbol 
ID3707036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp838653 
End bp841526 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content55% 
IMG OID637737272 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_342813 
Protein GI77164288 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.815806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCTA TGAAGCCACA AACTCCTGAA ACCAACCCCG CGCTCCAAGA TCAGACAGGG 
AATACCACTC CCTGGCGGGA TAAGGAACTG CGCGCCCGCG TCAAACTATT TGGCAATCTC
CTAGGCCAGG TCATCCAAAA CCAGTCCGGG GAAAAAGTAT TCGCCGCTGT CGAGGCCTTG
CGCAAGGGCT ATATTAACCT GCGCAAAAAA GAAAATTCTG ACAAGCGAAT CCAGTTGCTG
CGCCTTATCG ACACGCTGAA TGTGGAGAAG ATCACCCAGG TCGTGCGGGC CTTTAGCATC
TACTTTAGCC TCGCCAATAT TGCCGAAGAA GCTTACCAGC ACCGGCAACG GCAACGCCGC
ATCGATGCAG GCGGACCTCT TTGGCGAGGC TCCTTCGAAG AGACTCTACG AGAACTCCGG
AAGAGCGGAA TTAGTCCCGA GCAACTCCAG ATTATGCTGG ATAATCTAGC TTATATTCCC
GTCATTACCG CCCATCCCAC CGAAGCCAAG CGCCGCACGG TGATGGAACA TCTCCGCAAA
ATCTTCCTTG CCAGTAAACT TCTGGACGAG ACGCGCCTCA GTCAGCGGGA GGAAGAAACC
CTCCACCGCC AATTAGAACG GCAAATCCAG GTCCTGTGGA AAACCGATGA AGTTCGCGCC
CACCGGCCCC AGGTCCAGGA CGAAATCATC AACGGCCTAT TTTATTTCAA AGTTAGCTTG
TTCCAGGCCG TACCCGAAAC CTATCGCCAA CTAGAAGAAG CCATCAACAA AGTCTATGGC
GACATGCTGC CCGAGAGCAC CACTCTCCGG GTCCCCAGCT TTTTACACTT TGGCTCCTGG
ATTGGCGGAG ACCGGGATGG CAACCCCAAT GTCACGCCGG AAGTCACCGC CATGGCCGTA
CGGCTGCAAA TGCGGATGGC CCTACAGCAT TATATCGCCT GCATTACTAA ATTAACGCGC
ATTCTCACCC ATTCCATTCC TCTCATCAGG CCCTCAACGG CCCTGACTGA AAGCATTAAC
CAAGATTTGA GCGATTGCCC GGAGACCTTT CGAGGCGATC CCGACCGCTT TAGCCGCGAA
CCTTACCGGC GCAAACTCTA CCTGATGCGC TACCGCCTGA TGGATAACTT ACGGGCCGTG
GAACGATATC TCCTGCCAGA AATCCAGCCC ACCCCCCCTC AAGGCGTCGG TTATCCTTCC
GAGGAAAAAT TTCTTGAGGA TCTCTGCCTC CTTCGCGACA GCCTGAGCAG CCATGGTGAC
GGCAACATTG CCGCAGGGGA ATTGCAAGAT CTGATTCGCC TAGTAGAAAG TTTTGGCTTT
TACCTCCTCA AACTGGATGT TCGCCAGGAA TCAGGCCGTC ATACGGAAGC AGTAGCGGAA
TTAGTCAAAC ACCTCGACCT GCATCCCAGC TATCTCGATC TTTCCGAAAC TGAACGGCTA
GGGCTCCTTT CCGAACAACT CGCCCGCGAA GAAGAAACCA CCATCCAGCG GGAGCGGCTT
ACCCCCGCTA CCCGGGAAAC ACTGGATCTC TTTCACGTCA TGGCGCAAAT GCGCCAAGAG
GTCAGCCCCC GAGTTTTCGG TCATTATGTG ATTTCCATGA CCCATGCCGC CAGTCATGTC
ATGGAGGTCA TGTATCTAGG CTATCTTGCT GGCCTCGCTG GACGTCGGCG GGGTCAATGG
CATTGTGGTC TGCAAATCTC TCCCCTGTTT GAAACCATCG AGGATTTAGA GCATATCGAG
CCGGTCATGA CCGCCCTGCT TGATGATCCC AGCTATCGAG CCTTACTACA GGCCGCCGGC
AACCAGCAAG AAGTCATGAT TGGCTACTCA GATTCCTGCA AGGACGGCGG TATCCTGGCT
TCCTCCTGGA AACTGTATGA CGCCCAGAAA AAGGTAACCG GGCTCACCGA TAGCCGGGGG
GTGGATTGTC GTATCTTCCA TGGCCGGGGC GGGACCATTG GCCGGGGCGG TGGCCCAACT
TTTGACGCTA TCCTGTCCCA ACCCCAGGGG ACTGTCCACG GTCAAATCAA GTTCACGGAA
CAGGGAGAAG TCCTCTCCTC CCGCTACAGT AACCCCGAGA CCGCGATTTA TGAACTCAAC
ATGGGTATCA GTGGCCTGAT TAAGGCCAGC ACCTGCCTCG TCCAACCTCC CCAGGAAGAA
AAGCGTGATT ATCTCGGTAT CATAGACTCT TTAGTGGAAA CAGGGGAGCA GACTTACCGG
GAATTCACGG AACAAACGTC CGGCTTTCAA GATTATTTCT ATGAAGCCAC CCCGGTCAAT
GAAATTGGTC TGTTGAACAT TGGCTCCCGC CCCCCCCACC GGAAAAAAGG AGATCGTTCC
AAGAACTCAG TCCGGGCTAT CCCCTGGGTC TTTGGCTGGG CCCAGGCCCG GCATACTTTT
CCCGCCTGGT ATGGTATCGG CAGCGCCTTG GAAAAATGGC GGGCTGGCGC GCCCGATCGG
CTCGCAAAAC TCCAAACCAT GTATGAAGAG TGGCCCTATT TCCGTGCCCT GCTCAGCAAT
ACCCAAATGT CCCTGGCCAA GGCCGAGTTG CATATCGCTC AGCAATATGC CGGCTTGTGC
CTAGATCCAG AAACGGGACA GAAAATCTTT GCCCTGCTCA GCGCGGAGTA CCAACGCACG
GTCACCCAGG TGCTCCATAT CGTGGGGGCC CACACCCTGC TGGAGGAGAA CCCTCCCCTG
GCTCTGTCCT TGCAACGCCG GGACCCCTAC CTGGACCCCC TCAATCATAT CCAACTCACT
CTTCTTAAAC GCACCCGCGA TCCACGAATC ACTCCCGAGG AGCGGGAAGC ATGGCTTGAT
CCTCTGCTCC GTTCTATCAA TGCCATCGCG GCTGGGATGC GCAATACGGG CTGA
 
Protein sequence
MTSMKPQTPE TNPALQDQTG NTTPWRDKEL RARVKLFGNL LGQVIQNQSG EKVFAAVEAL 
RKGYINLRKK ENSDKRIQLL RLIDTLNVEK ITQVVRAFSI YFSLANIAEE AYQHRQRQRR
IDAGGPLWRG SFEETLRELR KSGISPEQLQ IMLDNLAYIP VITAHPTEAK RRTVMEHLRK
IFLASKLLDE TRLSQREEET LHRQLERQIQ VLWKTDEVRA HRPQVQDEII NGLFYFKVSL
FQAVPETYRQ LEEAINKVYG DMLPESTTLR VPSFLHFGSW IGGDRDGNPN VTPEVTAMAV
RLQMRMALQH YIACITKLTR ILTHSIPLIR PSTALTESIN QDLSDCPETF RGDPDRFSRE
PYRRKLYLMR YRLMDNLRAV ERYLLPEIQP TPPQGVGYPS EEKFLEDLCL LRDSLSSHGD
GNIAAGELQD LIRLVESFGF YLLKLDVRQE SGRHTEAVAE LVKHLDLHPS YLDLSETERL
GLLSEQLARE EETTIQRERL TPATRETLDL FHVMAQMRQE VSPRVFGHYV ISMTHAASHV
MEVMYLGYLA GLAGRRRGQW HCGLQISPLF ETIEDLEHIE PVMTALLDDP SYRALLQAAG
NQQEVMIGYS DSCKDGGILA SSWKLYDAQK KVTGLTDSRG VDCRIFHGRG GTIGRGGGPT
FDAILSQPQG TVHGQIKFTE QGEVLSSRYS NPETAIYELN MGISGLIKAS TCLVQPPQEE
KRDYLGIIDS LVETGEQTYR EFTEQTSGFQ DYFYEATPVN EIGLLNIGSR PPHRKKGDRS
KNSVRAIPWV FGWAQARHTF PAWYGIGSAL EKWRAGAPDR LAKLQTMYEE WPYFRALLSN
TQMSLAKAEL HIAQQYAGLC LDPETGQKIF ALLSAEYQRT VTQVLHIVGA HTLLEENPPL
ALSLQRRDPY LDPLNHIQLT LLKRTRDPRI TPEEREAWLD PLLRSINAIA AGMRNTG