Gene Noc_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2047 
Symbol 
ID3705023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2356196 
End bp2358151 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content59% 
IMG OID637738522 
Productacetoacetyl-CoA synthetase 
Protein accessionYP_344037 
Protein GI77165512 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR01217] acetoacetyl-CoA synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAC CCCTGTGGAG ACCGTCGAGC GCCCAGGTGC AGCAGGCCCA GATGACCTGT 
TTCCTGCACA CCGTCCAGGA CCAATACGGG GCTAAGGTCG CGGATTATCC CGCCCTTTAC
GCCTGGTCGC TCCAGCAACC CGAGGATTTT TGGGCGGCGG TGTGGCGTTT CTGTGAAGTG
AAGTCTTCCC AGCCTTGGGA GCGCGTGCTG GAAAACGGCG ACTCGATGCC GGGGGCGCGC
TGGTTTGTAG GTAGCCGGTT GAACTTCGCG GAGAATCTGC TGCGCTACCG AGATGAGCGA
CTGGCCCTGG TATTCCGGGG CGAAGAGGGT CGCCGATGCG CCCTTAGCTA CGGAGAGCTT
TACCTCCAAG TGGCGCGTTT AGCCCAAGCC TTGAAGGGAG CAGGTGTGGG TGTGGGGGAT
CGGGTTGTGG GGTGGTTGCC CAATGTGCCG GAGACCGTTA TGGCCATGCT GGCTACCACC
AGTCTGGGCG CCATTTGGTC TTCTTGTTCG CCGGATTTTG GCATTCAGGG GGTGCTGGAT
CGTTTTGGGC AAATCGGTCC TAAAGTGCTG TTGGCCGCTG ATGGCTATCA TTACAAAGGC
AAGGCTATTG ATTCCCTGGC CCGCCTAGCT GAAATCGTGG AGTCCTTGCC GGGCCTGCGG
CAGGTAGTTG TCGTCCCGTA CCTTCACAGC TCGCGGGATC TAGCCCCGAT CCCCAAGGTC
CAAACGTATG AAGCCTTTAT GGGGGAGGCG GAAAATTCGC CGCTCACCTT TGCCCAGCTA
CCCTTCGATC ATCCAATTTA TATTCTGTAC TCTTCGGGCA CCACCGGCGT CCCTAAGTGT
ATTGTCCATG GAGCTGGCGG GACCCTGCTC CAGCACCTCA AGGAACTGAG GCTCCATACG
GATCTAAAAC GGGAAGATCG GATCTTTTAT TACACCACCT GCGGCTGGAT GATGTGGAAC
TGGCTGGTGA GCGCCCTGGC AACGGGGGCT ACCGTGGTGC TCTATGACGG AGCGCCGCTT
TATCCCCGGC CTGAGAGTTT GTGGGATATG GCTGCCGAAG AGGGCATCAC CGTTTTTGGC
GCTAGCGCTA AATATCTTTC GGCCTTGGAA AAGGCGGGGG TCCAGCCGGC CCGCACCCAT
CATCTGTCAA AGCTCCGGAC CCTGCTTTCC ACCGGCTCCC CCCTGGCCCC CGAGTCCTTT
GACTATGTCT ACCGGGATAT CAAAGAGGAG GTGCAATTAT CCTCCATTTC CGGCGGCACC
GACATTGTGT CCTGTTTCGC CCTGGGCAAC CCCATCCTTC CCGTCTACCG GGGCGAACTG
CAATGCCGCG GCTTGGGAAT GAAGGTGGAG ATCTTTGATG AGAAAGGCCG TTCTGTCCAG
GGGCAGAAGG GGGAGCTGGT GTGCACCGCC CCGTTTCCCG CTATGCCGGT TTTTTTCTGG
AACGATCCAG GGGGCAAAAA ATACTGGGCC GCCTATTTCG AGCGATTCCC TGGAGTCTGG
GCCCATGGGG ATTATGCGGA GCTGACAGCC CATGGGGGCT TGATCATTCA TGGCCGCTCC
GACACGGTGC TCAATCCGGG GGGGGTGCGG ATTGGCACCG CCGAGATTTA CCGCCAGGTA
GAAAAACTGC CGGAAGTTCT CGAGAGTCTG GCCATCGGGC AAGCCTGGCA AGGGGATGTA
CGGATTATCC TGTTCGTGGT TCTCCGGGAG GGCCGGGTGC TGGACGAGGC GTTGATTAAC
CGCATTAGAC AGACGATTCG AAAACACGCC TCGCCCCATC ATAGGCCAGC CAAGGTGCTT
CAAGTCCCCG ATATGCCCCG CACCCTAAGC GGCAAACTGG TGGAGCTGGC GGTCAGCCAT
ACCATCCATG GCCGTCCGGT AAAAAATCTG GATGCCCTGG CCAATCCAGA AGCCTTGGAG
TATTTCCGGG ATCGGCCGGA ACTCCAAACC GATTAA
 
Protein sequence
MQEPLWRPSS AQVQQAQMTC FLHTVQDQYG AKVADYPALY AWSLQQPEDF WAAVWRFCEV 
KSSQPWERVL ENGDSMPGAR WFVGSRLNFA ENLLRYRDER LALVFRGEEG RRCALSYGEL
YLQVARLAQA LKGAGVGVGD RVVGWLPNVP ETVMAMLATT SLGAIWSSCS PDFGIQGVLD
RFGQIGPKVL LAADGYHYKG KAIDSLARLA EIVESLPGLR QVVVVPYLHS SRDLAPIPKV
QTYEAFMGEA ENSPLTFAQL PFDHPIYILY SSGTTGVPKC IVHGAGGTLL QHLKELRLHT
DLKREDRIFY YTTCGWMMWN WLVSALATGA TVVLYDGAPL YPRPESLWDM AAEEGITVFG
ASAKYLSALE KAGVQPARTH HLSKLRTLLS TGSPLAPESF DYVYRDIKEE VQLSSISGGT
DIVSCFALGN PILPVYRGEL QCRGLGMKVE IFDEKGRSVQ GQKGELVCTA PFPAMPVFFW
NDPGGKKYWA AYFERFPGVW AHGDYAELTA HGGLIIHGRS DTVLNPGGVR IGTAEIYRQV
EKLPEVLESL AIGQAWQGDV RIILFVVLRE GRVLDEALIN RIRQTIRKHA SPHHRPAKVL
QVPDMPRTLS GKLVELAVSH TIHGRPVKNL DALANPEALE YFRDRPELQT D