Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2001 |
Symbol | |
ID | 3704886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2306097 |
End bp | 2307950 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637738478 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_343993 |
Protein GI | 77165468 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.181202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCTT ATCGATCCCG AACGACGACT CACGGTCGTA ATATGGCTGG TGCCCGGGCC TTGTGGCGGG CCACCGGTAT GAAAGAGGGC GATTTTGGTA AGCCCATTAT TGCCATTGCC AATTCTTTTA CTCAGTTCGT CCCCGGTCAT GTCCACCTCA AGGATTTAGG ACAGCTAGTC GCCCGGGAGA TTGAAAAAGC CGGGGGGGTG GCCAAGGAAT TTCATACTAT TGCCGTGGAT GACGGAATTG CCATGGGGCA TAGTGGCATG CTGTATTCTC TGCCTTCGAG GGAAATCATT GCCGATTCAG TAGAATACAT GGTCAATGCC CACTGCGCCG ATGCTTTGGT GTGCATTTCT AATTGTGACA AGATTACTCC GGGTATGCTG ATGGCTGCCA TGCGCTTAAA TATTCCAGCA GTATTTATCT CGGGTGGGCC CATGGAAGCG GGCAAGGTTA AAATTCGGGG TAAGAGTGTA AGCTTAGATT TGGTAGATGC CATAGTGGCG GCGGTTGACC CTGCTGAAAG CGATGCCGAT GTAATGGCCT ATGAGCGCTC AGCTTGTCCT ACCTGCGGTT CTTGCTCCGG GATGTTCACT GCTAACTCGA TGAATTGCCT GACCGAAGCC TTGGGGTTAG CGTTGCCAGG CAATGGTTCC TTGCTGGCGA CTCATGCTGA CAGGAAAGAA TTGTTCCTAG AAGCAGGACG CTTGATTGTG GCGTTGGCAA AACGTTATTA CGAGCAGGAT GATGAAACTG TTTTGCCGCG CTCAATTGCT AATTTTGGGG CTTTTGAGAA TGCCATGAGT CTGGATATCG CTATGGGCGG TTCGACTAAT ACGGTGCTTC ACTTGCTGGC CGCTGCTCAG GAAGGGGGAG TGGATTTCAC GATGGCGGAT ATTGATCGCT TGTCCCGTAA GGTGCCCAAT TTGTGTAAAG TGGCTCCTGC AACGCCAGAA TATCACATGG AGGATGTTCA CCGGGCGGGG GGTGTCATTA GTATTTTGGG GGAATTAGAT CGGGCAGGAT TGATCCATCG CCAAATGGCA ACCGTTCATA GCCCGACCTT GGGCGCGGCG CTTGACCAAT GGGATATCGT CCGTTCCAGC TATGAGGCTG CCCAGAGTCG CTATCTTGCC GCTCCGGGGG GTGTCCCTAC CCAAGTGGCT TTTAGCCAAG GGAATCGCTG GGAAAGTCTG GATTTGGATA GGGCGCAAGG TTGTATTCGC GATATTGCCC ATGCTTACAG CAAGGATGGG GGGTTGGCGG TGCTTTATGG TAACCTCGCA AAGGATGGTT GTATTGTCAA GACTGCTGGA GTAGACCCGT CGATATTGAT TTTTTCCGGG CCGGCCCGGC TATTTGAGAG TCAGGAGGCG GCGATAGCAG CTATTCTGGG AGATAAAATT CAGCCGGGTG ACGTTGTGCT TATTCGTTAT GAAGGCCCCA AGGGGGGACC TGGAATGCAG GAGATGCTCT ATCCCACCAG TTATCTGAAA TCTAAAGGAT TAGGCGAAGT CTGTGCGCTC ATCACGGATG GCCGCTTCTC TGGAGGGACT TCGGGACTTT CTATTGGCCA CGTTTCTCCT GAAGCGGCTG AAGGTGGCAC CATCGGTTTG GTGGAGGAAG GTGACAGAAT TGAAATCGAC ATTCCTCATC GGCGTATTCA TCTCGCAGTG GACGAGGAGG AATTAGCACA GCGCCAAAGA GCCATGGAGG CAAAGGCGCA GCAGGCTTGG CGGCCAGTTA ATCGCAATCG TACTGTATCC CTGGCGCTGC AGGCCTACGC GGCGCTCACC ACCTCAGCAG CGAAAGGTGC GGTTCGGGAC TTGGGGCAAC TTAATAGATC ATAG
|
Protein sequence | MPAYRSRTTT HGRNMAGARA LWRATGMKEG DFGKPIIAIA NSFTQFVPGH VHLKDLGQLV AREIEKAGGV AKEFHTIAVD DGIAMGHSGM LYSLPSREII ADSVEYMVNA HCADALVCIS NCDKITPGML MAAMRLNIPA VFISGGPMEA GKVKIRGKSV SLDLVDAIVA AVDPAESDAD VMAYERSACP TCGSCSGMFT ANSMNCLTEA LGLALPGNGS LLATHADRKE LFLEAGRLIV ALAKRYYEQD DETVLPRSIA NFGAFENAMS LDIAMGGSTN TVLHLLAAAQ EGGVDFTMAD IDRLSRKVPN LCKVAPATPE YHMEDVHRAG GVISILGELD RAGLIHRQMA TVHSPTLGAA LDQWDIVRSS YEAAQSRYLA APGGVPTQVA FSQGNRWESL DLDRAQGCIR DIAHAYSKDG GLAVLYGNLA KDGCIVKTAG VDPSILIFSG PARLFESQEA AIAAILGDKI QPGDVVLIRY EGPKGGPGMQ EMLYPTSYLK SKGLGEVCAL ITDGRFSGGT SGLSIGHVSP EAAEGGTIGL VEEGDRIEID IPHRRIHLAV DEEELAQRQR AMEAKAQQAW RPVNRNRTVS LALQAYAALT TSAAKGAVRD LGQLNRS
|
| |