Gene Noc_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2001 
Symbol 
ID3704886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2306097 
End bp2307950 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content53% 
IMG OID637738478 
Productdihydroxy-acid dehydratase 
Protein accessionYP_343993 
Protein GI77165468 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.181202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCTT ATCGATCCCG AACGACGACT CACGGTCGTA ATATGGCTGG TGCCCGGGCC 
TTGTGGCGGG CCACCGGTAT GAAAGAGGGC GATTTTGGTA AGCCCATTAT TGCCATTGCC
AATTCTTTTA CTCAGTTCGT CCCCGGTCAT GTCCACCTCA AGGATTTAGG ACAGCTAGTC
GCCCGGGAGA TTGAAAAAGC CGGGGGGGTG GCCAAGGAAT TTCATACTAT TGCCGTGGAT
GACGGAATTG CCATGGGGCA TAGTGGCATG CTGTATTCTC TGCCTTCGAG GGAAATCATT
GCCGATTCAG TAGAATACAT GGTCAATGCC CACTGCGCCG ATGCTTTGGT GTGCATTTCT
AATTGTGACA AGATTACTCC GGGTATGCTG ATGGCTGCCA TGCGCTTAAA TATTCCAGCA
GTATTTATCT CGGGTGGGCC CATGGAAGCG GGCAAGGTTA AAATTCGGGG TAAGAGTGTA
AGCTTAGATT TGGTAGATGC CATAGTGGCG GCGGTTGACC CTGCTGAAAG CGATGCCGAT
GTAATGGCCT ATGAGCGCTC AGCTTGTCCT ACCTGCGGTT CTTGCTCCGG GATGTTCACT
GCTAACTCGA TGAATTGCCT GACCGAAGCC TTGGGGTTAG CGTTGCCAGG CAATGGTTCC
TTGCTGGCGA CTCATGCTGA CAGGAAAGAA TTGTTCCTAG AAGCAGGACG CTTGATTGTG
GCGTTGGCAA AACGTTATTA CGAGCAGGAT GATGAAACTG TTTTGCCGCG CTCAATTGCT
AATTTTGGGG CTTTTGAGAA TGCCATGAGT CTGGATATCG CTATGGGCGG TTCGACTAAT
ACGGTGCTTC ACTTGCTGGC CGCTGCTCAG GAAGGGGGAG TGGATTTCAC GATGGCGGAT
ATTGATCGCT TGTCCCGTAA GGTGCCCAAT TTGTGTAAAG TGGCTCCTGC AACGCCAGAA
TATCACATGG AGGATGTTCA CCGGGCGGGG GGTGTCATTA GTATTTTGGG GGAATTAGAT
CGGGCAGGAT TGATCCATCG CCAAATGGCA ACCGTTCATA GCCCGACCTT GGGCGCGGCG
CTTGACCAAT GGGATATCGT CCGTTCCAGC TATGAGGCTG CCCAGAGTCG CTATCTTGCC
GCTCCGGGGG GTGTCCCTAC CCAAGTGGCT TTTAGCCAAG GGAATCGCTG GGAAAGTCTG
GATTTGGATA GGGCGCAAGG TTGTATTCGC GATATTGCCC ATGCTTACAG CAAGGATGGG
GGGTTGGCGG TGCTTTATGG TAACCTCGCA AAGGATGGTT GTATTGTCAA GACTGCTGGA
GTAGACCCGT CGATATTGAT TTTTTCCGGG CCGGCCCGGC TATTTGAGAG TCAGGAGGCG
GCGATAGCAG CTATTCTGGG AGATAAAATT CAGCCGGGTG ACGTTGTGCT TATTCGTTAT
GAAGGCCCCA AGGGGGGACC TGGAATGCAG GAGATGCTCT ATCCCACCAG TTATCTGAAA
TCTAAAGGAT TAGGCGAAGT CTGTGCGCTC ATCACGGATG GCCGCTTCTC TGGAGGGACT
TCGGGACTTT CTATTGGCCA CGTTTCTCCT GAAGCGGCTG AAGGTGGCAC CATCGGTTTG
GTGGAGGAAG GTGACAGAAT TGAAATCGAC ATTCCTCATC GGCGTATTCA TCTCGCAGTG
GACGAGGAGG AATTAGCACA GCGCCAAAGA GCCATGGAGG CAAAGGCGCA GCAGGCTTGG
CGGCCAGTTA ATCGCAATCG TACTGTATCC CTGGCGCTGC AGGCCTACGC GGCGCTCACC
ACCTCAGCAG CGAAAGGTGC GGTTCGGGAC TTGGGGCAAC TTAATAGATC ATAG
 
Protein sequence
MPAYRSRTTT HGRNMAGARA LWRATGMKEG DFGKPIIAIA NSFTQFVPGH VHLKDLGQLV 
AREIEKAGGV AKEFHTIAVD DGIAMGHSGM LYSLPSREII ADSVEYMVNA HCADALVCIS
NCDKITPGML MAAMRLNIPA VFISGGPMEA GKVKIRGKSV SLDLVDAIVA AVDPAESDAD
VMAYERSACP TCGSCSGMFT ANSMNCLTEA LGLALPGNGS LLATHADRKE LFLEAGRLIV
ALAKRYYEQD DETVLPRSIA NFGAFENAMS LDIAMGGSTN TVLHLLAAAQ EGGVDFTMAD
IDRLSRKVPN LCKVAPATPE YHMEDVHRAG GVISILGELD RAGLIHRQMA TVHSPTLGAA
LDQWDIVRSS YEAAQSRYLA APGGVPTQVA FSQGNRWESL DLDRAQGCIR DIAHAYSKDG
GLAVLYGNLA KDGCIVKTAG VDPSILIFSG PARLFESQEA AIAAILGDKI QPGDVVLIRY
EGPKGGPGMQ EMLYPTSYLK SKGLGEVCAL ITDGRFSGGT SGLSIGHVSP EAAEGGTIGL
VEEGDRIEID IPHRRIHLAV DEEELAQRQR AMEAKAQQAW RPVNRNRTVS LALQAYAALT
TSAAKGAVRD LGQLNRS