Gene Noc_1491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1491 
Symbol 
ID3705982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1650427 
End bp1651527 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content53% 
IMG OID637737978 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_343507 
Protein GI77164982 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.139847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTC CTACTGAAGA TCTTCGCATT AAGAACATTC AAGAAGTTAT TCCCCCGGCC 
CAACTCCATG AAGGCTTGCC CATTACCAGC GAGGCATCAA AAACAGTCTA TCGAACCCGG
CAAGCCATTC AGGAAGTGCT CGGCGGGAAA GATGACCGCC TGCTGGTGGT CGCTGGCCCC
TGCTCCATCC ATGATCCCCA GGCTGCACGG GATTATGGGA AAAGACTCAA GCTACTAATT
GATGAGCTTG CCGATGAATT GCTCATCGTC ATGCGCGTTT ATTTTGAGAA ACCTCGCTCT
ACAGTAGGCT GGAAAGGGCT TATTAACGAT CCCCATTTGG ACGGCAGCTT TCAGATTAAT
GAAGGCTTGC GCTTGGCCCG CAGGCTGCTG TTGGATCTGG CCGAAACAGG TGTGCCGGCA
GGCACCGAAT ACCTGGATCT CGTTAGCCCA CAATATATCG CTGATCTAAT TGCCTGGGGC
GCAATCGGGG CTCGCACAAC TGAAAGTCAA GTGCATCGGG AACTCGCCTC GGGACTCTCA
TGTCCGGTTG GCTTTAAGAA TGCGACCAAC GGCAGCTTAG GTGCCGCCAT GTCCGCTATC
GTCTCGGCCT CAAAGCCCCA TCATTTTCTC TCCCTCACCT TGGCTGGCCG TTCGGCTATT
TTTTCAACTG CCGGAAATCC AGATTGCCAT CTCATTTTAC GGGGTGGGCA GAAACCTAAT
TACGATGCAG CAAGCGTCAA CGAGACGGCC CATAATCTTA TTCAAACGGG CCTTCGGCCT
CAAGTCATGA TTGATTGCAG CCATGGCAAC AGCAGCAAGA ATCCCAAGAA GCAAGTTCTG
GTTGCCAGGG ATATTGCGGG ACAAATTGCT GCGGGCGATA GGCGAATCAT GGGGGTTATG
CTGGAAAGCC ATCTCGTAGC AGGACGCCAG GACGTCATCC CAAACACCCC TCTTACCTAT
GGCCAAAGCA TCACCGATGC CTGTATAGGC TGGGAGGAAA GCGAGCAGTT ACTTCGTGAA
TTTGCCCGCG CCATACAGAA GCGGCGGCAA ATGCCAGAAA AACACATTGA AGCTAAACAT
GGATGTTCAG CAACCCCCTA G
 
Protein sequence
MSFPTEDLRI KNIQEVIPPA QLHEGLPITS EASKTVYRTR QAIQEVLGGK DDRLLVVAGP 
CSIHDPQAAR DYGKRLKLLI DELADELLIV MRVYFEKPRS TVGWKGLIND PHLDGSFQIN
EGLRLARRLL LDLAETGVPA GTEYLDLVSP QYIADLIAWG AIGARTTESQ VHRELASGLS
CPVGFKNATN GSLGAAMSAI VSASKPHHFL SLTLAGRSAI FSTAGNPDCH LILRGGQKPN
YDAASVNETA HNLIQTGLRP QVMIDCSHGN SSKNPKKQVL VARDIAGQIA AGDRRIMGVM
LESHLVAGRQ DVIPNTPLTY GQSITDACIG WEESEQLLRE FARAIQKRRQ MPEKHIEAKH
GCSATP