Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1051 |
Symbol | |
ID | 3707234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1158218 |
End bp | 1159558 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637737556 |
Product | Acetyl-CoA carboxylase, biotin carboxylase |
Protein accession | YP_343089 |
Protein GI | 77164564 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00003463 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGATA AGATTGTTAT CGCCAATCGG GGCGAAATCG CCTTGCGTAT CCTCCGGGCT TGCTGGGAAT TAGGGCTCAA AACCGTAGCC ATCCACTCTG AAGTGGATCG CGAACTCAAA CATGTTCTAC TAGCGGATGA GACGGTTTGT ATCGGCCCTG CGGCATCTTC TCAAAGCTAC TTGAATATTC CCGCCGTGAT CAGCGCCGCT GAGATTACCG ATGCCGTCGC CATTCACCCA GGTTACGGTT TTTTGTCCGA AAACGCCGAC TTTGCTGAAC GCGTTGAACA AAGCGGCTTC GTCTTTATCG GGCCGCGGCC TGAGACTATC CGTCTGATAG GCGATAAAGT CTCCGCTATT AAGGCCATGA AGTCCTCCGG CGTGCCATGC GTACCCGGCT CCGAAGGGCC CCTCGGAGAA GATGATGAGG AAAATATAGC CATTGCCAAG GAAATCGGCT ATCCGGTCAT GATTAAGGCT TCAGGGGGAG GGGGAGGCCG AGGAATGCGC GTTGTTCATT CTGAGGCGCA TTTGCCCACC GCTATTTCCC TCACCCGGAG CGAAGCCAGC GCCGCCTTTG GCAATGACAT GGTTTACATG GAAAAATATC TGGAAAATCC TCGTCATGTG GAATTCCAAG TTCTGGCCGA CACCCACGGT CAAGCCATCT ACCTCGGCGA GCGGGACTGT TCCATGCAGC GCCGTCACCA GAAAGTTGTT GAAGAGGCAC CTGCCCCAGG CATTACCAAT GAACAACGGC AACGCATGGG AGAAATCTGC ACTGAAGCCT GCCGCAAGAT GGGTTACCGA GGAGCAGGTA CGTTTGAATT TCTCTATCAA GATGGCGAAT TTTATTTCAT TGAAATGAAT ACCCGAGTCC AGGTGGAACA CCCTGTAACT GAAATGATCA CCGGGATAGA CATTGTCAAG GAGCAACTCC GTATTGCTGC CGGAGAGAAG CTCAGTTATC GCCAGGAAGA TATCATGATC CACGGGCACG CCATCGAGTG CCGCATCAAC GCCGAGGATC CCACTAATTT CATGCCCAGC CCAGGAACGG TGACAAGATA TCATACGCCT GGTGGCCCGG GCGTCCGAAT AGATTCCCAC CTATACGCTG GTTATACTGT TCCCCCTCAC TACGATTCTT TGATCGGCAA ACTCATTACC CATGGGGAAA CCCGGGAAGC AGCCATTGCG CGCATGCAAA TTGCACTCAC TGAACTGGTC ATCGATGGCA TTAAGTGTAA TGCGCCACTC CATCAAAAAA TCCTCGACAA CACGCACTTC CGGGCTGGCG GCGCTAATAT CCACTACCTA GAGCGAATGC TAGGATTATA G
|
Protein sequence | MLDKIVIANR GEIALRILRA CWELGLKTVA IHSEVDRELK HVLLADETVC IGPAASSQSY LNIPAVISAA EITDAVAIHP GYGFLSENAD FAERVEQSGF VFIGPRPETI RLIGDKVSAI KAMKSSGVPC VPGSEGPLGE DDEENIAIAK EIGYPVMIKA SGGGGGRGMR VVHSEAHLPT AISLTRSEAS AAFGNDMVYM EKYLENPRHV EFQVLADTHG QAIYLGERDC SMQRRHQKVV EEAPAPGITN EQRQRMGEIC TEACRKMGYR GAGTFEFLYQ DGEFYFIEMN TRVQVEHPVT EMITGIDIVK EQLRIAAGEK LSYRQEDIMI HGHAIECRIN AEDPTNFMPS PGTVTRYHTP GGPGVRIDSH LYAGYTVPPH YDSLIGKLIT HGETREAAIA RMQIALTELV IDGIKCNAPL HQKILDNTHF RAGGANIHYL ERMLGL
|
| |