Gene Noc_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1014 
Symbol 
ID3707275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1122030 
End bp1123118 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content57% 
IMG OID637737519 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_343052 
Protein GI77164527 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAATTGCCCT TTTGGCGGGC GATGGGATCG GTCCCGAGAT TATTGCCGAG 
GCGGTTAAGG TACTGGCGGC GTTGGGGGAG CATGGGGAAC TGGATGTGGT GCTAGAGTCA
GGGCTGATTG GGGGTGCTGC CTATGAGGTC ACGGGCACTC CGCTGCCTGC GGAAACTCTT
GCTCTAGCCC AGGATTCGGA TGCGATCCTG CTAGGCGCCG TCGGGGGTCC CCAGTGGGAA
ACCCTGGATA TTAAAGTCCG CCCTGAAAAA GGATTACTTG GCATTCGTTC CGCGCTAGGG
CTATTTGCTA ATCTGCGTCC TGCTATTCTT TACCCTCAAT TGGTAGGAGC CTCTACGCTC
AAACCGGAGG TGGTGTCAGG GCTCGATCTC ATGATAGTGC GGGAGCTGAC CGGCGGTATT
TATTTTGGCC AGCCCCGGGG GGTGTGCCGC CTTGAGCATG GCGAGCGGCA AGGGTTTAAT
ACCCTGGTTT ATAAGGAATC GGAAATCCGG CGCATTGCCC AGGTGGCCTT TACCATTGCC
ATGAAACGGG ATCGCCGGGT TTGCTCCGTG GACAAAGCCA ATGTGCTGGA AGCGACCGAG
CTATGGCGCG AGGTAATGTC TGAGGTAGCC GCAGAGTTTC CCGAAGTGGA GCTTAGCCAC
ATGTATGTAG ACAATGCTGC CATGCAGCTT GTTCGAGCCC CCAAGCAATT TGATGTGGTG
GTAACTACGA ATATGTTTGG GGATATTTTG TCCGATTGCG CCGCAATGCT GACCGGCTCT
ATCGGGATGC TGCCTTCCGC CTCCCTGGAT CAAAGCGGGA GGGGAATGTA CGAGCCCATC
CACGGTTCGG CGCCGGATAT TGCTGGCCAA GGGATAGCTA ATCCCCTGGC GACCATCCTC
TCCGTAGCGA TGATGTTGCG TTACTCTCTC GATTCGCCCG CCCAGGCCAC GCGCCTTGAA
AAGGCTGTAG GTGCCGTACT TGACCAAGGA TTGCGGACTC CCGATATTTA TTCTCCCGGC
ACGACCCAGG TCAGCACTGG GGCCATGGGG GATGCGGTCG TGGAGGCATT TATCCACACC
CCCGGTTGA
 
Protein sequence
MSKKIALLAG DGIGPEIIAE AVKVLAALGE HGELDVVLES GLIGGAAYEV TGTPLPAETL 
ALAQDSDAIL LGAVGGPQWE TLDIKVRPEK GLLGIRSALG LFANLRPAIL YPQLVGASTL
KPEVVSGLDL MIVRELTGGI YFGQPRGVCR LEHGERQGFN TLVYKESEIR RIAQVAFTIA
MKRDRRVCSV DKANVLEATE LWREVMSEVA AEFPEVELSH MYVDNAAMQL VRAPKQFDVV
VTTNMFGDIL SDCAAMLTGS IGMLPSASLD QSGRGMYEPI HGSAPDIAGQ GIANPLATIL
SVAMMLRYSL DSPAQATRLE KAVGAVLDQG LRTPDIYSPG TTQVSTGAMG DAVVEAFIHT
PG