Gene Nmul_A1292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1292 
Symbol 
ID3784329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1485371 
End bp1486390 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content58% 
IMG OID637811379 
ProductShort-chain dehydrogenase/reductase SDR 
Protein accessionYP_411987 
Protein GI82702421 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAG AAGACAACGC CAATGCCGCA CGGCGCCAGT TTGTGGGCGG CATGACAACC 
GGCCTTGCCG CAGCATTTGT CTACCCGGCC TTCGCTCAGC AAGGGCAGCA AGCAAACCCT
TCCGGCTCCC TGCAAGGGCC CGGCCAGTCC CGTAAGCAGG ATCCGAGAAC GCAATATCCT
ATTCCCCCCT TTCCCCAGCA AAAACAGGAG CCTCCCGGCC TTGTAAGCAA GATGATGCCG
CGGCCTGATC ACGGCGAGAC AACCTACAAG GGTTCGGGGC GGCTAGTGGA TAGAAAGGCG
CTTGTGACCG GTGGAGATTC GGGCATTGGA CGCGCCGCCG CCATCGCCTT TGCGCGTGAG
GGAGCCGATG TCGCGATTAA TTATCTCCCG GTCGAGGAGT CCGATGCTCG TGAGGTCGTG
GAAATCATCC GGGCAGAAGG GCGAAAGGCG GTCGCGATTC CTGGCGATAT CAGGGATGAG
AATTTTTGCT CCAGGCTTGT CGCCAACGCC GTCCGGGAGC TGGGCGGACT GGATATCCTT
GTCAACAATG CGGCCATGGC TGTCGCACAG CCCTCTATCG TTGATCTCAC GACAGAACAG
TTCGATTCGA TCTTCAAATG CAACGTCTAT GCCATGTTCT GGATCACCAA GGCGGCCATG
CCGCACCTTA AGCCCGGAGC GGCCATCATC AATACAAGTT CTGTTGAGGC TTACACTCCA
TCCGATGCGT TTCTCGACTA CGCCCAGACA AAGGCGTGCA ATGTTGCTTT CACGAAATCA
CTGGCGAAGC AGTTGGCCAA CAAGGGTATC CGGGTGAATG CGGTGGCGCC GGGACCATTC
TGGACACCGT TGCAGACGGC TGGATGGGCG GATCTCAGCA GGTTGGGCAA GGAGACTCCG
CTTGGCAGAC CCGGTCAACC AGCGGAACTG GGTCCCCTGT ATGTTTTCCT TGCATCACAG
GAATCAAGCT ATGCAACCGG ACAGGTGTAC GGCGCTTCAG GGGGGGAAGG GCAGCCCTAA
 
Protein sequence
MKPEDNANAA RRQFVGGMTT GLAAAFVYPA FAQQGQQANP SGSLQGPGQS RKQDPRTQYP 
IPPFPQQKQE PPGLVSKMMP RPDHGETTYK GSGRLVDRKA LVTGGDSGIG RAAAIAFARE
GADVAINYLP VEESDAREVV EIIRAEGRKA VAIPGDIRDE NFCSRLVANA VRELGGLDIL
VNNAAMAVAQ PSIVDLTTEQ FDSIFKCNVY AMFWITKAAM PHLKPGAAII NTSSVEAYTP
SDAFLDYAQT KACNVAFTKS LAKQLANKGI RVNAVAPGPF WTPLQTAGWA DLSRLGKETP
LGRPGQPAEL GPLYVFLASQ ESSYATGQVY GASGGEGQP