Gene Nmul_A0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0472 
Symbol 
ID3784889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp527769 
End bp528785 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content57% 
IMG OID637810548 
Productketol-acid reductoisomerase 
Protein accessionYP_411172 
Protein GI82701606 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0059] Ketol-acid reductoisomerase 
TIGRFAM ID[TIGR00465] ketol-acid reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTTT ATTACGATAA AGACGCCGAC TTGTCGCTCA TTCGGGACAA GAAAGTCACC 
ATTGTCGGCT ACGGGTCGCA AGGTCACGCC CACGCCAACA ATCTGAGCGA TTCCGGCGTG
GCGGTAACAG TTGGCCTGCG CAAGGAAGGT GCTTCCTGGG GCAAGGCGGA AAAGGCCGGG
CTTACCGTCA AAGAGGTGGC TGAATCGGTA AAGGATGCAG ACGTCGTGAT GGTCCTGCTG
CCCGATGAGC AGATTGCTGA TGTATATGCG ACTGAAATCG AACCCAACCT CAAGAAAGGT
GCTACTCTTG CCTTTGCCCA TGGCTTCAAT ATTCATTATG GCCAAGTAGC GCCCAGGGAA
GATCTGGACG TCATCATGAT CGCTCCCAAG GGGCCGGGAC ACCTGGTACG CTCCACCTAC
CTCCAGGGCG GGGGTGTGCC TTCACTTATT GCAGTGCACC AGGACAAGTC CGGCAGGGCA
CGTGACCTGG CGCTCTCCTA TGCGGCTGCC AACGGCGGCA CCCGTGGCGG AGTGATCGAA
ACCAATTTCC GCGAGGAAAC CGAAACCGAT CTTTTCGGCG AACAGGTCGT GCTGTGCGGT
GGTCTGACCG CCTTGATTCA GGCCGGCTTT GAAACCCTGG TGGAAGCCGG CTACGCCCCG
GAGATGGCCT ATTTCGAATG TCTGCACGAA GTCAAGCTGA TCGTCGACCT GATCTATGAA
GGCGGCATCG CCAACATGCG CTACTCCATT TCCAACAACG CCGAGTATGG GGATATTTCG
CGCGGTCCCC GTGTGATCAC CGACGCCACC CGTGCCGAAA TGCGCAAGAT TCTCCGCCAG
ATTCAGACAG GGGAATATGC CCGCGAATTC ATCCTTGAAA ATCGCGCCGG CGCACCCATG
CTCAAAGCCA GCCGCCGTCT CGCATCCGAG CACCAGATCG AACAGGTGGG CGCCAAACTG
CGCGATATGA TGCCCTGGAT CAAAAAGAAC AAGCTGGTCG ATCAGGCGAA AAATTAG
 
Protein sequence
MNVYYDKDAD LSLIRDKKVT IVGYGSQGHA HANNLSDSGV AVTVGLRKEG ASWGKAEKAG 
LTVKEVAESV KDADVVMVLL PDEQIADVYA TEIEPNLKKG ATLAFAHGFN IHYGQVAPRE
DLDVIMIAPK GPGHLVRSTY LQGGGVPSLI AVHQDKSGRA RDLALSYAAA NGGTRGGVIE
TNFREETETD LFGEQVVLCG GLTALIQAGF ETLVEAGYAP EMAYFECLHE VKLIVDLIYE
GGIANMRYSI SNNAEYGDIS RGPRVITDAT RAEMRKILRQ IQTGEYAREF ILENRAGAPM
LKASRRLASE HQIEQVGAKL RDMMPWIKKN KLVDQAKN